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55 ( 54 > Title: IMMUNOGLOBULIN BINDING PROTEIN ARRAYS IN PLANT CELLS* 
r-H 

(57) Abstract: Arrays of immunoglobulin binding proteins in plants or plant cells are provided. Such arrays comprise plants or plant 
Q cells transformed with polynucleotides encoding multiple different immunoglobulin binding proteins, or polypeptide components 
^ thereof. Methods are further provided for genetic segregation of the transformation events such that each transformant in an array is 
^ capable of producing progeny capable of expressing one or more immunoglobulin binding proteins, including multi-subunit proteins. 
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IMMUNOGLOBULIN BINDING PROTEIN ARRAYS IN PLA NT CELLS 

Technical Field 



25 



30 



The present invention relates generally to arrays of immunoglobulin 
binding proteins. The invention is more particularly related to methods for the 
expression of arrays of foreign immunoglobulin binding proteins in eukaryotic cells 
such as plant cells, as well as to transformed eukaryotic cells that express such arrays. ' 

Background of the Invent,™ 

Immunoglobulin molecules play key roles in a variety of physiological 
processes. Such molecules, which include antibodies and portions thereof, are critical 
for nnmune system function, and have found numerous therapeutic and diagnostic 
applications. The discovery of immunoglobulin molecules with desired binding 
characteristics is the focus of many current drug discovery efforts. 

Traditional techniques for immunoglobulin molecule discovery involve 
the expression of a multitude of immunoglobulin molecule genes in an array of 
hybndoma cells, other forms of immortalized B-lymphocytes or phage-infected 
bactena. For monoclonal antibody expression, individual antibody-producing B- 
lymphocytes from an immunized animal are generally fused with cells derived from an 
immortalized B-lymphocyte tumor. Clones of hybrid cells are then screened to identify 
those that grow indefinitely and secrete the desired immunoglobulin molecule The 
polynucleotides encoding the monoclonal antibodies can then be isolated and used to 
express all or part of the antibody in other organisms, such as bacteria, yeast and plants 
The ability to express immunoglobulins on the surface of bacteriophage has enabled the 
generation of immunoglobulin libraries that could represent all possible combinations 
of Heavy and light chains derived from any population of B-lymphocytes These 
hbranes have been used successfully to identify high affinity combining sites 
recogmzmg a wide variety of antigens. A significant drawback to this technique is the 
randomization of heavy and light chain combining sites requiring the generation of very 
large numbers of recombinant phage to identify specific heavy and light chain binding 
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pairs. This combinatorial aspect of random libraries makes expression of these libraries 
in other organisms unfeasible. Newer technologies involve transgenic mice expressing 
antibodies from human chromosomal segments which can be used to generate 
hybridoma arrays expressing human antibodies. 
5 Arrays formed in B-lymphocytes, phage infected bacteria or transgenic 

animals have been useful within certain immunoglobulin molecule screens, but 
difficulties have been encountered with producing large quantities of immunoglobulin 
molecules in these cells. Large-scale production of immunoglobulin molecules from 
any of the traditional organisms is typically very expensive. Further, phage infected 
10 bacteria are incapable of providing the variety of immunoglobulin molecule structures 
that may be desired. Similarly, the usefulness of transgenic animal cells has been 
limited by the susceptibility of such cells to infection with viruses or other 
microorganisms. 

For economic and other reasons, it would be desirable to use genetically 

15 engineered plants as the primary vehicle for the discovery of immunoglobulin 
molecules, as well as for the ultimate production of immunoglobulin molecules to be 
used in industrial, clinical or research applications. The advantages of plants for 
production of immunoglobulin molecules include a low cost of production, relatively 
low capital investment compared to fermentation systems, the absence of animal 

20 viruses and prions, production of the immunoglobulin molecule in a biochemical 
background of defined proteins such as seed proteins, ease of storage and transport, and 
a facile scale-up to unlimited quantities of raw material. It would also be desirable to 
be able to express a library of binding proteins that is not derived from a combinatorial 
process of randomly paired heavy and light chains. 

25 It is known that immunoglobulin molecules can be expressed in a variety 

of eukaryotic hosts including plant cells. A wide variety of structural genes have been 
isolated from mammalian cells and viruses, joined to transcriptional and translational 
initiation and termination regulatory signals from a source other than the structural 
gene, and introduced into plant hosts in which these regulatory signals are functional. 

30 Among those host cells that have been transformed with individual immunoglobulin 
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molecule-encoding nucleic acids are monocots (e.g., corn, rice and wheat), dicots (e g. 
tobacco, soybean, alfalfa, petunia, and Arabidopsis) and lower plants (eg. 
Chlamydomonasy Plants transformed with nucleic acids encoding individual 
ammunoglobulin molecules have been able to produce folly functional and fully 
5 assembled immunoglobulins (see Hiatt et al., Nature 342:16-1%, 1989; Firek et al 
Plant Molecular Biology 23:861-870, 1993; Van Engelen et al., Plant Molecular 
Biology 2*1701-1710, 1994; Ma et al., Science 268:116-119, 1995; Magnuson et al 
Protein Expression and Purification 7:220-228, 1996; Schouten et al., Plant Molecular 
Biology 30:181-193, 1996; Fiedler et al., Immunotechnology 5:205-216, 1997- Verch et 
10 al., J. Immunol Meth. 220:69-15, 1998; Zeitlin et al., Nature Biotechnology^ 6:1361- 
1364, 1998; DeJaeger et al., Eur. J. Biochem. 259:426-434, 1999; Fischer et al Biol 
Chem. 380:825-839, 1999; Khoudi et al., Biotechnology and Engineering 64(2):135- 
143, 1999; McCormick et al., Proc. Natl Acad. Sci. USA 96:103-108, 1999; Russell, 
Curr. Top. Microbiol. Immunol 240:119-138, 1999). 
15 In previous plant cell transformations, the transforming nucleic acid 

introduced a single immunoglobulin molecule. For example, tobacco plants have been 
transformed with individual gamma or kappa chains to produce individual plants 
expressmg immunoglobulin molecule components. The respective tobacco 
transformants were then cross-pollinated to produce plants expressing a single 
20 anybody, wherein covalent bond formation between the two components resulted in the 
formanon of enhanced binding capacity. In another instance, an antibody molecule was 
introduced into a single plant using a single vector. The vector encoded two 
nnmunoglobulin component chains and resulted in the formation, in the plant of an 
mununoglobulin molecule comprising covalently linked heavy and light 
25 immunoglobulin chains. & 

Plant cells have not been used to express a diversity of immunoglobulin 
molecules in an array. As noted above, the ability to prepare an array of 
immunoglobulin molecules in plants or plant cells would facilitate identification of 
useful nnmunoglobulin molecules and would enable a rapid transition from 
30 nnmunoglobulin molecule discovery to full scale production in a single organism 
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Accordingly, there remains a need in the art for methods for generating 
arrays of immunoglobulin molecules in plants and plant cells, as well as other 
eukaryotic organisms and cells. The present invention fulfills these needs and further 
provides other related advantages. 

5 

Summary of the Invention 

Briefly stated, the present invention provides methods for the production 
of arrays of biologically or physiologically active immunoglobulin binding proteins in 
eukaryotic cells. Within certain aspects, methods are provided for preparing an 

10 immunoglobulin binding protein array in plant cells, comprising the steps of: (a) 
transforming a population of plant cells with a library of at least two different 
polynucleotides encoding different immunoglobulin binding protein (IgBP) 
polypeptides that: (i) specifically bind to a ligand with a K D <10" 6 moles/liter; or (ii) 
form one or more disulfide bonds with one or more polypeptides in the transfected cell, 

15 to generate a binding protein that specifically binds to a ligand with a K D <10" 6 
moles/liter; wherein the IgBP polypeptides (i) comprise four framework regions (e.g., 
human or murine) alternating with three complementarity determining regions and (ii) 
comprise at least one peptide sequence having at least 75%, preferably at least 95%, 
sequence identity to a framework region of a native IgM, IgG, IgA, IgD, IgE, IgY, 

20 kappa or lambda immunoglobulin molecule; and wherein the IgBP polypeptides are not 
detectably expressed by the plant cells prior to transformation; and (b) selecting 
transformed plant cells, and therefrom preparing an IgBP array in plant cells. Each 
IgBP polypeptide may be a functional IgBP or an IgBP component (e.g., a portion of an 
immunoglobulin molecule selected from the group consisting of heavy chains and 

25 fragments thereof, light chains and fragments thereof, J chains and secretory 
components) that, upon disulfide linkage to one or more IgBP components encoded by 
other polynucleotides in the library, forms a functional IgBP. Within certain specific 
embodiments, a library employed in such methods commprises at least 10, 100, 1,000 
or 10,000 different polynucleotides. 
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Within certain embodiments, such methods further comprise the step of: 
(c) growing the transformed plant cells on a growth medium that supports replication of 
the plant cells, such that functional IgBPs are assembled by the plant cells. Within 
other specific embodiments, such methods further comprise the steps of: (c) growing 
5 the transformed plant cells on a growth medium to form plants; and (d) sexually 
crossing the plants with themselves or other plants to generate progeny, such that the 
progeny comprise polynucleotides encoding IgBP components sufficient to form a 
functional IgBP. Such progeny may be seeds, or may be plants or plant cells that 
assemble functional IgBPs, and the IgBP polypeptides may, but need not, be secreted 
1 0 from the plant cells. 

The present invention further provides, within other aspects, methods for 
prepanng a heavy chain binding protein array in eukaryotic cells (e.g., plant, insect or 
mammalian cells), comprising the steps of: (a) transforming a population of eukaryotic 
cells wdh a library of at least two different polynucleotides, wherein each 
polynucleotide encodes a different heavy chain binding protein (C H BP) polypeptide 
that: (,) compnses an amino acid sequence that is at least 75% identical to a constant 
regxon tailpiece of a mu or alpha chain of a native immunoglobulin heavy chain- (ii) 
compnses multiple combining sites, wherein all of the combining sites satisfy the same 
one of the following requirements: (1) at least 75% identity to a 25 consecutive amino 
10 acid portion of an immunoglobulin light chain variable region or (2) at least 75% 
identxty to a 25 consecutive amino acid portion of an immunoglobulin heavy chain 
vanable region; and (iii) either (1) specifically binds to a ligand with a K D <10< 
moles/liter; or (2) forms one or more disulfide bonds with one or more polypeptides in 
the transfected cell, to generate a C H BP that specifically binds to a ligand with a K D <1CT 
5 moles/liter; and (b) growing the transformed cells on a medium that permits assembly 
of C H BPs, wherein each C H BP compnses at least four combining sites; and therefrom 
prepanng a C H BP array in eukaryotic cells. The polynucleotides may, for example 
encode immunoglobulin alpha or mu chains. Within certain embodiments, the cells are' 
further transformed with one or more polynucleotides encoding polypeptides having 
sequences that are at least 75% identical to a sequence of an immunoglobulin J chain 
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Resulting C H BPs may be assembled, for example, from four alpha chains and one J 
chain, from twelve mu chains and/or from ten mu chain and at least one J chain. C H BPs 
or components thereof may, but need not, further comprise one or more portions of 
immunoglobulin molecules selected from the group consisting of J chains, secretory 
5 components and light chain constant regions. The C H BPs may accumulate in an 
intracellular compartment of the cells or may be secreted from the cells. 

Within further aspects, methods are provided for preparing a heavy 
chain binding protein array in eukaryotic cells, comprising the steps of: (a) exposing 
multiple copies of a polynucleotide encoding a native heavy chain to a mutagen, such 

10 that random or site-directed mutagenesis of the polynucleotide occurs, resulting in a 
library of heavy chain variants; (b) transforming a population of eukaryotic cells with 
the library of heavy chain variants; and (c) growing the transformed cells on a medium 
that permits assembly of C H BPs, wherein each C H BP comprises at least four combining 
sites; and therefrom preparing a C H BP array in eukaryotic cells. 

15 Methods are further provided for preparing a plant C H BP array, 

comprising the steps of: (a) transforming a population of plant cells with a library of at 
least two different polynucleotides, wherein each polynucleotide encodes a different 
C H BP component that forms one or more disulfide bonds with one or more 
polypeptides in the transformed cell to generate a C H BP that specifically binds to a 

20 ligand with a K D <10" 6 moles/liter, wherein each component: (i) comprises an amino 
acid sequence that is at least 75%, preferably at least 95%, identical to a constant region 
tailpiece of a mu or alpha chain of a native immunoglobulin heavy chain; and (ii) 
comprises multiple combining sites, wherein all of the combining sites satisfy the same 
one of the following requirements: (1) at least 75% identity to a 25 consecutive amino 

25 acid portion of an immunoglobulin light chain variable region or (2) at least 75% 
identity to a 25 consecutive amino acid portion of an immunoglobulin heavy chain 
variable region; (b) growing the transformed plant cells on a growth medium to form 
plants; and (c) sexually crossing the plants to generate progeny, such that the progeny 
comprise polynucleotides encoding C H BP components sufficient to form a functional 

30 C H BP that comprises at least four combining sites; and therefrom preparing a plant 
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C H BP array. The progeny may be seeds, or may be plants or plant cells that assemble 
functional C H BP, Within certain specific embodiments, a library employed in such 
methods commprises at least 10, 100, 1,000 or 10,000 different polynucleotides The 
C H BPs may accumulate in an intracellular compartment of the cells or may be secreted 
from the cells. 

Within further aspects, the present invention provides C„BP arrays in 
eukaryotic cells, comprising a, leas, two eukaryotic ceHs („, plant , insect or 
mammalian cells) that are each transfotmed with a different pofynac.eofide encoding a, 
least one CBP polypeptide mat: (a) comprises an amino acid sequence fha, fs a, leas. 
75% .dentical to a constant region taiipicce of a nut or alpha chain of a native 
trnmunoglobutin heavy chain; (b) comprises multiple combining si.es, wherein all of 
tire combining si.es satisfy tire same one of the following tegmenta: (i) a. leas, 75% 
tdentny to a 25 consecutive amino acid portion of an inrmunogfobufin light chain 
vanable region or (r) a, feas, 75% identity ,„ a 25 consecutive amino acid portion of an 
.mmunogtobufin heavy chafn vanable region; (c) either (f) specificaffy bfnds to a figand 
w.«h a K D < 10 -« moles/liter; or (if) forms one or mote covalen. bonds with one or more 
polypeptides in the transfected ceff, ,o generate a C„BP that specificaffy binds ,o a 
hgand with a K B <f0* mofes/liter; and (d) differs in amino acfd sequence from other 
C„BPs m dte atray; wherein tire eefls assemble C„BPs comprising a, leas, four 
combining sites. The po.ynucleotides may, for exampfc, encode pofypepride 
components of immunoglobuiin mofecules fndependentfy sefected from fhe group 
consisting of heavy chains and fragments thereof, light chains and fragments .hereof J 
chams and secrecy components. Within certafn specific embodiments, me cefls fn me 
an-ay assembfe a, feas, !0, fOO, f,000 or ,0,000 different pofynucfeofides. Afso within 
certiun embodfments, each ceff in such an array may be ,ransfec.ed wfth a, feas, rivo 
dtfferen, polynucieotides, each encodfng a different C H BP componenf, such fha, each 
cell assembles a ftrnctionaf C H BP comprising the C H BP components. 

Within further aspects, tire present invention provides compositions 
compnsing an array of encapsufated C H BPs, wherein each C„BP: (a) comprises an 
ammo acid sequence ma, is a, feas, 75% identica. ,o a consten, region tailpiece of a mu 
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or alpha chain of a native immunoglobulin heavy chain; (b) comprises at least four 
combining sites, wherein all of the combining sites satisfy the same one of the 
following requirements: (i) at least 75% identity to a 25 consecutive amino acid 
portion of an immunoglobulin light chain variable region; or (ii) at least 75% identity to 
5 a 25 consecutive amino acid portion of an immunoglobulin heavy chain variable 
region; and (c) either (i) specifically binds to a ligand with a K D <10~ 6 moles/liter; or (ii) 
forms one or more covalent bonds with one or more polypeptides in a cell, to generate a 
C H BP that specifically binds to a ligand with a K D <1 0" 6 moles/liter; and (d) differs in 
amino acid sequence from other C H BPs in the array. 

10 Methods are further provided for preparing a heavy chain binding 

protein array in eukaryotic cells, comprising the steps of: (a) exposing multiple copies 
of a polynucleotide encoding a native heavy chain to a mutagen, such that random or 
site-directed mutagenesis of the polynucleotide occurs, resulting in a library of heavy 
chain variants; (b) transforming a population of eukaryotic cells with the library of 

15 heavy chain variants; and (c) growing the transformed cells on a medium that permits 
assembly of C H BPs, wherein each C H BP comprises at least four combining sites; and 
therefrom preparing a C H BP array in eukaryotic cells. 

Within further aspects, the present invention provides C H BPs that: (a) 
comprise an amino acid sequence that is at least 75% identical to a constant region 

20 tailpiece of a mu or alpha chain of a native immunoglobulin heavy chain; (b) comprise 
at least four combining sites, wherein all of the combining sites satisfy the same one of 
the following requirements: (i) at least 75% identity to a 25 consecutive amino acid 
portion of an immunoglobulin light chain variable region; or (ii) at least 75% identity to 
a 25 consecutive amino acid portion of an immunoglobulin heavy chain variable 

25 region; and (c) either (i) specifically bind to a ligand with a K^IO" 6 moles/liter; or (ii) 
form one or more covalent bonds with one or more polypeptides in a cell, to generate a 
C H BP that specifically binds to a ligand with a K D <10" 6 moles/liter. 

These and other aspects of the present invention will become apparent 
upon reference to the following detailed description and attached drawings. All 
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references disclosed herein are hereby incorporated by reference in their entirety as if 
each was incorporated individually. 

Brief Description of the Drawing 

Figures 1 A- IB is a table depicting representative framework region (FR) 
sequences for human and mouse heavy and light chains, as indicated (SEQ ID Nos: 74- 
197). 



Detailed Description of the Invention 

As noted above, the present invention is directed to arrays of 
immunoglobulin binding protein (IgBP) polypeptides in eukaryotic organisms or cells, 
such as plants and plant cells. Each cell within such an array is transformed with at 
least one polynucleotide encoding one or more IgBPs or polypeptide components 
thereof (either integrated within the nuclear genome or resident in the cytoplasm). 
Within certain embodiments, each cell comprises polynucleotides encoding multiple 
polypeptide components of one or more IgBPs, such that each cell is capable of 
assembling at least one functional (i.e., biologically active) IgBP. Transformed cells 
may further comprise biologically active IgBPs, although in some cases (e.g., seeds 
generated from transformed plants), the functional IgBP may not be generated. 
Transgenic arrays embodying the present invention are useful for the discovery of novel 
IgBPs. Within certain embodiments, IgBPs within arrays provided herein are not 
derived from a combinatorial process of randomly paired heavy and light chains. It has 
been found, within the context of the present invention, that plants and other eukaryotic 
cells are capable of assembling IgBPs comprising any variable region (from light or 
heavy chain) expressed as specific heavy chain isotypes. 

In certain embodiments, IgBP arrays may be generated in high yields in 
transformed, sexually reproducible plants. Transgenic plants provided herein are 
generally morphologically normal, but for the presence of one or more copies of the 
foreign polynucleotides in some or all of their cells. The respective gene products can 
be present in substantially all or a portion of the plant cells (e.g., the products can be 
localized to a cell type, tissue or organ). From the transgenic arrays, individual 
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organisms or clones of cells can be identified rapidly, enabling the easy access to an 
economical, high yield process for the large scale production of a desired IgBP. 

Glossary 

5 Prior to setting forth the present invention in detail, definitions of certain 

terms used herein are provided. 

Immunoglobulin binding protein (IgBP): An immunoglobulin binding 
protein (i) comprises an amino acid sequence that is at least 75% identical to at least 
one framework region of a native immunoglobulin molecule (e.g., IgM, IgG, IgA, IgD, 

1 0 IgE, IgY kappa or lambda) and (ii) is a functional binding protein. Framework regions 
are described below, under "Immunoglobulins." A protein P is a functional binding 
protein if (1) for one molecular, ionic or atomic ligand A the K D (P, A)<10' 6 moles/liter 
(preferably <10 7 moles/liter), where K D (X,Y) = [X][Y]/[X:Y], and (2) for a different 
molecular, ionic or atomic species B, K D (P, B^IO" 4 moles/liter. Such a protein P is 

15 said to specifically bind A. Immunoglobulin binding proteins (IgBPs) generally 
function as a binding protein by virtue of the properties of a sequence of amino acids 
comprising a combining site, as defined below. An IgBP may comprise a single 
immunoglobulin chain or fragment thereof, multiple identical immunoglobulin chains 
or fragments thereof, or multiple non-identical immunoglobulin chains or fragments 

20 thereof. IgBPs include, for example, single chain antigen binding proteins, Fabs and 
Fvs. Also included are heavy chain binding proteins (C H BPs), discussed in greater 
detail below. 

Component of an IgBP: a polypeptide capable of forming one or more 
covalent bonds (preferably disulfide bonds) with one or more other polypeptides to 

25 generate a functional binding protein. A component is not itself a functional binding 
protein. For example, a multimeric antibody is considered an IgBP, and the 
polypeptide chains that are joined by covalent bonds to form an antigen binding site are 
considered to be IgBP components. Examples of such components include but are not 
limited to heavy chains and fragments thereof, light chains and fragments thereof, J 

30 chain and fragments thereof, and secretory component and fragments thereof. 
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IgBP polypeptide: encompasses both functional IgBPs and IgBP 

components. 

Immunoglobulin heavy chain binding protein (C H BP): an IgBP that (i) 
comprises multiple combining sites derived from (i.e., at least 75% identical to at least 
25 consecutive amino acids of) either immunoglobulin light chain or heavy chain 
variable regions, but not both; and (ii) comprises a native heavy chain constant region 
sequence, or a fragment or other variant thereof, provided that the amino acid sequence 
of such a component is at least 75% identical to a constant region tailpiece (defined 
below) of a mu or alpha chain of a native immunoglobulin heavy chain. A C H BP that 
comprises combining sites derived from one or more heavy chain variable regions does 
not comprise a combining site derived from a light chain variable region. Similarly, a 
C H BP that comprises combining sites derived from one or more light chain variable 
regions does not comprise a combining site derived from a heavy chain variable region. 
Multiple C H BP components may be covalently linked to generate a functional C„BP, or 
a single polypeptide may be sufficient. Representative C„BPs include proteins 
assembled from four alpha chains and one J chain, from twelve mu chains or from ten 
mu chains and at least on J chain. 

Heavy chain binding protein (C H BP) component: a polypeptide that is 
capable of forming one or more covalent bonds (preferably disulfide bonds) with one or 
more other polypeptides to generate a functional C H BP. The component is not itself a 
binding protein. Examples of such components include, but are not limited to, heavy 
chains and fragments thereof, and J chain and fragments thereof. 

Heavy chain binding protein (C H BP) polypeptide: includes both C„BPs 
and C H BP components. 

Immunoglobulin: any of the structurally related proteins or 
glycoproteins that function as antibodies. Polypeptides can be determined to comprise 
an immunoglobulin sequence based on sequence homology to known heavy chains, 
lights chains, J chains and related immunoglobulin sequences (see, e.g., Kabat et al., 
Sequences of Immunological Interest, National Institutes of Health, Bethesda, Md. 
1991). In general, an immunoglobulin sequence should be at least 95% identical to a 



WO 01/83806 



PCT/US01/14349 



12 

known immunoglobulin sequence over at least 50 consecutive amino acid residues of a 
constant region, if the immunoglobulin contains a constant region, and 75% identical to 
at least 25 consecutive amino acids of a variable region. Immunoglobulins may 
comprise multiple immunoglobulin components. Examples of such components 
5 include, but are not limited to, heavy chains and fragments thereof, light chains and 
fragments thereof, J chain and fragments thereof, and secretory component and 
fragments thereof. An immunoglobulin is generally identified by its binding specificity 
for a unique epitope. 

( 

Immunoglobulins are composed of the linear combination of a basic 
10 domain structure. Each domain contains two beta-pleated sheets, one beta sheet 
consisting of four beta strands, the other consisting of three beta strands. The two beta 
sheets are covalently linked by a disulfide bond. Antibody variable regions contain 
three sequences termed complementarity determining regions (CDR) within which are 
amino acid sequences of high variability when comparing numerous variable region 
15 sequences. Flanking each CDR are sequences of lesser variability termed framework 
regions (FR), of which there are four. The positions of the CDRs primarily coincide 
with the loops between beta strands, and conversely the FRs correlate with the beta 
strands themselves of the basic domain structure. For example, CDR1 (closest to the 
amino terminus of the immunoglobulin polypeptide) lies between beta strands 4-2 and 
20 3-1, and CDR2 is between 3-1 and 4-4. The three stranded beta sheets of variable 
regions are the contact areas between the light chain and heavy chain variable regions. 
The following table illustrates the typical structure of variable regions. 

Table 1 

25 Amino Acid Residues Associated with Framework Regions and Complementarity 
Determining Regions of Immunoglobulin Light and Heavy Chain Variable Domains 



Segment 


Light chain amino acids 


Heavy chain amino acids 


FR1 


1-23 


1-30 


CDR1 


24-34 


31-35 


FR2 


35-49 


36-49 
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B, an4 etas nray be found, for exan,p,e, in Kaba, e, a!,, Sequences of Inununofcgica, 
Interest, Nanonal Institutes of Health, Bethesda, Md. 1 991 . 

CDRs -n. T °V Sh ™ SKP ™ CaM ™ mn ^ n ™ tor ™™^^ 
CDRs. The spec.no sequences of individual CDRs can be found in Kaba, e, a! 

^uences of tanunologica, Interesl , National ^ Qf ^ ' 
1991- General rules for identify^ a CDR are as foUows: (■) CDR1 (ligh , chain)- 
star, ts approxnnately residue 24; residue before is a,ways a Cys; residue after is always 
a Trp TRP-TYR-GLN, bu, also, TRP-LEU-GLN, TRP-PHE-GLN, TRP- 

TVR-LEU); .engrh ,0 to ,7 residues; (2) CDR 2 (light chain); star, is always 16 
res-dues after ft. end of CDR ,; residues before generally ILE-TYR, bu, also, VAL- 
TVR, ILE-LVS, ILE -PHE; ,en g ft afways 2 residues (excep, 7FAB which has a dderion 
■n ftns regron); (3) CDR 3 (Ugh, chain); star, is always 33 residues after end of CDR 2 
(excep, 7FAB which has fc deletion „ fte end 0 f CDR-L2); residue before is always 
Cys; rescues after always PHE-OLY-XXX-GLY (SEQ ,D NO: „ (4) CDR , (heavy 

^ xT^:;;r due 26 (a,ways 4 **• a c ys); "-*- — »• ««■ 

XXX-XXX-XXX (SEQ n> NO: 2) . resjdue ^ ^ ^ 

also, T*P, LE , TRP-ALA; (5) CDR 2 (heavy chain): shu, is afways ,5 residues' 
*er fte end of CDR residues before „ LEU-GLU-TRP-ILE-GLY (SEQ ,D 
X bU * *" ~ * <* Mannas; residues ^ LYS/ARG- 
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LEU/TLE/V AL/PHE/THR/ALA-THR/SER/ILE/ALA; (6) CDR 3 (heavy chain): start is 
always 33 residues after end of CDR 2 (always 2 after a CYS); residues before always 
CYS-XXX-XXX (typically CYS-ALA-ARG); residues after always TRP-GLY-XXX- 
GLY (SEQ ID NO: 4). 

Table 2 



CDR consensus sequences 

Light chain sequences. 



Type 


CDR1 


CDR 2 


CDR 3 


Human kappa I 


RASQSLVSISSYLA 
(SEQ ID NO: 5) 


AASSLES 
(SEQ ID NO: 6) 


QQYNSLPEWT 
(SEQ ID NO: 7) 


Human kappa II 


RSSQSLLHSDGDTYLN 
(SEQ ID NO: 8) 


LVSNRAS 
(SEQ ID NO: 9) 


MQALQPRT 
(SEQ ID NO: 10) 


Human kappa III 


RASQSVSSSYLA 
(SEQ ID NO: 11) 


GASSRAT 
(SEQ ID NO: 12) 


QQYGSSPPLT 
(SEQ ID NO: 13) 


Human kappa IV 


KSSQSVLYSSNNKNYLA 
(SEQ ID NO: 14) 


WASTRES 
(SEQ ID NO: 15) 


QQYYSTPT 
(SEQ ID NO: 16) 


Human lambda I 


SGSSSNIIGNNYVS 
(SEQ ID NO: 17) 


DNNKRPS 
(SEQ ID NO: 18) 


ATWDDSLSANSAPV 
(SEQ ID NO: 19) 


Human lambda II 


TGTSSDVGGYNAVS 
(SEQ ID NO: 20) 


DVTDRPS 
(SEQ ID NO: 21) 


SSYGGGSNV 
(SEQ IDNO:22) 


Human lambda III 


SGDNLGDKYVH 
(SEQ ID NO: 23) 


DDNKRPS 
(SEQ ID NO:24) 


QAWDSSSDHPGVV 
(SEQ ID NO:25) 


Mouse kappa I 


KSSQSLLNSGNQKNYLA 
(SEQ ID NO: 26) 


WASTRES 
(SEQ ID NO:27) 


QNDYSYPLT 
(SEQ ID NO: 28) 


Mouse kappa II 


RSSQSLVHSNGNTYLE 
(SEQ ID NO: 29) 


KVSNRFS 
(SEQIDNO:30) 


FQGTHVPPYT 
(SEQ ID NO: 31) 


Mouse kappa III 


RASESVDSYGNSFMH 
(SEQ ID NO: 32) 


AASNLES 
(SEQ ID NO: 33) 


QQSNEDPPWT 
(SEQ ID NO: 34) 


Mouse kappa IV 


SASSSVSSSYLH 
(SEQ ID NO: 35) 


RTSNLAS \ 
(SEQ ID NO: 36) 


QQWSSYPGLT 
(SEQ ID NO: 37) 


Mouse kappa V 


RASQDDISNYLN 
(SEQ ID NO: 38) 


YASRLHS 
(SEQ ID NO: 39) 


QQGNTLPPRT 
(SEQ ID NO: 40) 


Mouse kappa IV 


SASSSVSYMH 
(SEQ ID NO: 41) 


DTSKLAS 
(SEQ ID NO: 42) 


QQWSSNPPMPLT 
(SEQ ID NO: 43) 
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Heavy chain sequences. 



Type 



CDR1 



Human heavy I 



Human heavy II 



Human heavy III 



Mouse heavy I 



Mouse heavy II 



SYAIS 

(SEQ ID NO: 44) 



SYGWSWN 
(SEQ ID NO: 47) 



SYAMS 

(SEQ ID NO: 50) 



SGYWNNS 
(SEQ ID NO: 53) 



DYYMNN 
(SEQ ID NO: 56) 



CDR 2 



WINPYGNGDTNYAQKFQG 
(SEQ ID NO: 45) 



RJYYRAYSGSTTYNPSLKS 
(SEQ ID NO: 48) 



VISGKTDGGSTYYADSVKG 
(SEQ ID NO: 51) 



YISGYSGSTYYNPSLKS 
(SEQ ID NO: 54) 



DINPGNGGTSYNQKFKG 
(SEQ ID NO: 57) 




CDR 3 



APG YGSGGGCYRGDYFDY 
(SEQ ID NO: 46) 



ELPGGYTGDDYYYGSGFDV 
(SEQ ID NO: 49) 



GRPGDSLSGYYYYYHYFDY 
(SEQ ID NO: 52) 



GGYGYGYYYYDYYYYFDY 
(SEQ ID NO: 55) 



GSYYSSSYMAYYAFDY 
(SEQ ID NO: 58) 



10 



15 



Immunoglobulin superfamily molecule: a molecule that has a domain 
size and amino acid residue sequence that is significantly similar to immunoglobulin or 
immunoglobulin related domains. The significance of similarity may be determined 
statistically as discussed above. A sequence identity of greater than ISVo with the 
immunoglobulins listed in Kabat et al. (Sequences of Immunological Interest, National 
Institutes of Health, Bethesda, Md. 1991) indicates a member of the immunoglobulin 
superfamily. Also, a typical Align score of less than 3 indicates that the molecule being 
tested is a member of the immunoglobulin gene superfamily. 

Combining site: a portion of a binding protein made up of those amino 
acid residues that contact the ligand or antigen by ionic interactions, hydrogen bonding, 
Van der Waals interaction or hydrophobic interaction. A combining site in a typical 
binding protein is located in a variable region of light and/or heavy chains and, more 
specifically, in a CDR of the fight and/or heavy chains. A combining site need not 
comprise more than one polypeptide. A combining site generally comprises four FR 
and three CDR sequences, which alternate as displayed in Table 1. The precise amino * 
acids present within a combining site may generally be determined by x-ray diffiaction 
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analysis of the binding protein with bound ligand or antigen. See, for example, Amit et 
al., Science 233:4765, 747-53, 1986. 

Immunoglobulin constant region: a portion of an immunoglobulin 
polypeptide that follows the carboxy terminus of the variable region. This is usually in 
5 the vicinity of amino acid #108 in light chains and amino acid #1 14 in heavy chains. 
Constant regions determine the isotype designation of the immunoglobulin and include 
but are not limited to kappa or lambda light chain constant regions and gamma, mu, 
alpha, epsilon and delta heavy chain constant regions. Constant regions of heavy 
chains are divided into domains. The first domain following the variable region is 

10 designated CHI. Domains following CHI include the hinge region, CH2, CH3 and 
possibly CH4 and membrane spanning domain. Examples of immunoglobulin constant 
regions can be found in Kabat et al., Sequences of Immunological Interest, National 
Institutes of Health, Bethesda, Md. 1991. 

Tailpiece of a constant region: tailpiece regions are located after the 

1 5 CH3 or CH4 segments of IgA or IgM constant regions, respectively. Examples of 
tailpieces are listed in Table 3. Other tailpiece regions may be identified based on 
similarity in sequences to one of the representative heavy chain tailpiece regions in 
Table 3. In general, a tailpiece should be at least 50% identical to a sequence in Table 3 
and always contains a cysteine as the penultimate carboxy terminus amino acid. When 

20 co-expressed in appropriate eukaryotic cells containing an endomembrane system, 
tailpiece regions attached to either alpha or mu constant regions are capable of forming 
a disulfide bond between the penultimate cysteine of the tailpiece and a cysteine in J 
chain. Formation of these disulfide bonds can result in the polymerization of alpha or 
mu constant regions. 

25 

Table 3 



Representative Heavy Chain Tailpiece Regions 



Isotype 


Species 


Sequence 


IgA 


Human 


PTHVNVSWMAEVDGTCY 
(SEQ ID NO: 62) 
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IgA 


Mouse 


PTVNVIMSEGDGICY 


IgA 


Mouse 


PTNVSVSVIMSEGDGICY 


IgA 


Rat 


PTNVNVSVIMSEGDGICY 


IgA 


Rabbit 


PTHVNVSVVVADVEAVCY 

(RVCi TD TsJfV 
^OJ-yVc 11-/ 1NU. ODJ 


IgM 


Human 


PTLYNVSLVMSDTAGTCY 


IgM 


Human 


PTLYNVSLIMSDTGGTCY 

/CCA TT"\ "MW /^O^ 


IgM 


Hamster 


PTLYNVSLIMSDAGGTCY 


IgM 


Hamster 


PTLYNVSLVLSDTAGZCY 
(SEQ ID NO- 70) 


IgM 


Rat 


PTLYNVSLIMSDTASTCY 
(SEQ ID NO: 71) 


IgM 


Chicken 


PSFVNVSLVLMDTVNSCN 
(SEQ ID NO: 72) 



J chain: a polypeptide that is substantially identical {i.e., at least 80% 
identical) in sequence to a J chain of Kabat et al. (Sequences of Immunological Interest, 
National Institutes of Health, Bethesda, Md. 1991). J chains are further capable of 
forming disulfide bonds to the penultimate cysteine residues in the tailpiece region of 
alpha or mu constant regions, thereby forming a polymeric structure, such as a dimeric 
IgA or a pentameric IgM. Disulfide bonds between J chain and alpha or mu heavy 
chain constant regions are generally formed in the endomembrane system of a cell 
during the process of secretion. The presence of disulfide bonds can be measured by 
comparison of the polypeptides in the presence or absence of an appropriate reducing 
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agent such as dithiothreitol or mercaptoethanol. Comparative analysis of peptides can 
be accomplished by, for example, denaturing gel electrophoresis using SDS and 
polyacrylamide as described in Molecular Cloning: A Laboratory Manual, Sambrook et 
al., eds., Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. (1989) and 
5 Antibodies: A Laboratory Manual, Harlow and Lane, eds., Cold Spring Harbor, N.Y. 
(1988). 

Alpha chain: a polypeptide that is substantially identical (i.e., at least 
90% identical) in sequence to the constant region of an alpha chain of Kabat et al. 
(Sequences of Immunological Interest, National Institutes of Health, Bethesda, Md. 
10 1991). 

Mu chain: a polypeptide that is substantially identical (i.e., at least 90% 
identical) in sequence to the constant region of a mu chain of Kabat et al. (Sequences of 
Immunological Interest, National Institutes of Health, Bethesda, Md. 1991). 

Secretory component: a polypeptide that binds to polymeric 

15 immunoglobulins containing J chains. Secretory components are derived from the 
polyimmunoglobulin receptor. The sequences of some secretory components and 
polyimmunoglobulin receptors have been determined. Other polypeptide that share at 
least 75% sequence identity with a known secretory component, and that retain the 
ability to bind to polymeric immunoglobulins containing J chains, are also considered 

20 to be secretory components. 

Heavy chain: a polypeptide that comprises an amino acid sequence that 
is at least 90% identical to the constant region of a native heavy chain sequence and an 
amino acid sequence that is at least 75% identical to the variable region of a native 
heavy chain sequence (see Kabat et al., Sequences of Immunological Interest, National 

25 Institutes of Health, Bethesda, Md. 1991) and that, when co-expressed with a light 
chain in the endomembrane system of an appropriate eukaryotic cell is capable of 
forming a heavy chain-light chain complex, potentially an antibody, joined by disulfide 
bonding. Heavy chains may be identified as the larger of the two polypeptides present 
within a divalent antibody, Fab, Fab'2 or Fv. Heavy chains are also found as 

30 components of polyvalent antibodies such as IgAs and IgMs. 
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Light chain: a polypeptide that is at least 90% identical to the constant 
region of a native light chain sequence and 75% identical to the variable region of a 
native light chain sequence (see Kabat et aL, Sequences of Immunological Interest, 
National Institutes of Health, Bethesda, Md. 1991) and that when co-expressed with a 
5 heavy chain in the endomembrane system of an appropriate eukaryotic cell is capable of 
forming a heavy chain-light chain complex, potentially an antibody, joined by disulfide 
bonding. Light chains may be identified as the smaller of the two polypeptides present 
within a divalent antibody, Fab, Fab'2 or Fv. Light chains are also found as 
components of polyvalent antibodies such as IgAs and IgMs. 
10 Light chain or heavy chain variable region: a region of a light chain or 

heavy chains that contains three complementarity determining regions (CDR), flanked 
by four framework regions (FR), as discussed above. A variable region generally has 
two antiparallel p-pleated sheets, one of which consists of four p* strands, and one of 
which consists of three p strands. In an antibody the three-stranded sheet is the contact 
15 area between heavy chain and light chain variable regions. The two p-pleated sheets 
are covalently linked by a disulfide bond (the half cystines that form this bond are 
conserved). The CDRs coincide with the loops between p strands, while the FRs 
correlate with the p strands themselves. An IgBP that does not comprise a light chain 
variable region generally does not contain a polypeptide that contains CDRs or FRs but 
20 may contain the constant region of a light chain. 

IgBP array: a population of eukaryotic cells or organisms (e.g., plants) 
that are transformed with different polynucleotides, each of which encodes a different 
IgBP or polypeptide component thereof. Each cell or organism is transformed with at 
least one such polynucleotide, and preferably at least two such polynucleotides, such 
25 that the array comprises at least two organisms or cells that are transformed with 
different polynucleotides. In general, the IgBP polypeptides encoded by the 
polynucleotides should not be detectably expressed by untransformed cells or 
organisms. An array may be displayed on a growth surface that allows for the 
replication of the cells or organisms. An array may be present, for example, in plants or 
30 plant cells, or may be stored in seed form. 
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Dicotyledon (dicot): A flowering plant whose embryos have two seed 
halves or cotyledons. Examples of dicots are: tobacco; tomato; the legumes including 
alfalfa; oaks; maples; roses; mints; squashes; daisies; walnuts; cacti; violets; and 
buttercups. 

5 DNA: Deoxyribonucleic acid. 

Epitope: A portion of a molecule that is specifically recognized by an 
immunoglobulin. It is also referred to as the determinant or antigenic determinant. 

Eukaryotic hybrid vector: A DNA molecule by means of which DNA 
coding for a polypeptide (insert) can be introduced into a eukaryotic cell. 
10 Fab fragment: A polypeptide consisting of a portion of an antibody 

molecule containing the active portions of an antibody heavy chain and an antibody 
light chain covalently coupled together and capable of specifically combining with 
antigen. Fab fragments are typically prepared by proteolytic digestion of substantially 
intact antibody molecules with papain using methods that are well known in the art. 
1 5 However an Fab fragment may also be prepared by expressing in a suitable host cell the 
desired portions of an antibody heavy chain and light chain using methods well known 
in the art. 

Fv fragment: A polypeptide consisting of the active portions of an 
antibody heavy chain variable region and an antibody light chain variable region 

20 covalently coupled together and capable of specifically combining with antigen. Fv 
fragments are typically prepared by expressing in a suitable host cell the desired 
portions of antibody heavy chain variable region and light chain variable region using 
methods well known in the art. 

Mutagenesis: A process whereby the nucleotide sequence of an original 

25 polynucleotide is changed in one or a few locations to produce derivative 
polynucleotides of substantially the same sequence. Mutagenesis can be accomplished 
by manipulation of polynucleotides in vitro by, for example, using various commonly 
available enzymes and mutagenic oligonucleotides. Mutagenesis can also be 
accomplished in vivo using the immune system of an animal to introduce desired 

30 changes in polynucleotides encoding immunoglobulins for example. B cell maturation, 
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for example, is a process whereby a mutated polynucleotide is derived from an original 
polynucleotide. & 

Insert: A DNA sequence foreign to the host, consisting of a structural 
gene and optionally additional DNA sequences. 

5 (Selective) Genetic marker: A DNA sequence coding for a phenotypical 

traxt by means of which transformed cells can be selected from untransformed cells. 

Leader sequence: A contiguous series of amino acids preceding a 
polypeptide. The leader sequence may be cleaved from the polypeptide during the 
process of secretion. 

Signal Sequence: A DNA sequence coding for an amino acid sequence 
attached .o the polypeptide which binds «he polypeptide <o the endoplasmic reticulum 
and is essential for protein secretion. 

Lower plant: Any non-flowering plant including ferns, gymnosperms 
comfers, horsetails, club mosses, liverworts, hornworts, mosses, red algae, brown algae 
gametophytes, sporophytes of pteridophytes, and green algae (e.g., Chlamydomonas). 

Monocotyledon (monocot): A flowering plant whose embryos have one 
cotyledon or seed leaf. Examples of monocots are: lilies; grasses; com; rice; grains 
including oats, wheat and barley; orchids; irises; onions and palms. 

Open reading frame: A sequence of nucleic acids when read . 
sequentially three at a time (triplets) contains no stop codon sequence, Stop codon 
sequences are UAG, UAA, and UGA. 

Paratope: An antigen binding site of an antibody molecule. 
Plant cell: A cell that depends for its growth on light and contains 

chloroplasts. 

Pollination: the transfer of pollen from male to female flower parts 
"Self-pollination" refers to the transfer of pollen from male flower parts to female 
flower parts on the same plant. This process typically produces seed from which 
progeny plants can be grown. "Cross-pollination" is the transfer of pollen from the 
male flower parts of one plant to the female flower parts of another plant. This process 
typically produces seed from which viable progeny can be grown. 
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Polypeptide and peptide: A series of amino acid residues covalently 
connected one to the other by peptide bonds between the alpha-amino and carboxy 
groups of adjacent residues or by disulfide bridges between two cysteines. 

Protein: A linear series of greater than about 50 amino acid residues 
5 connected one to the other as in a polypeptide. 

Promoter: A recognition site on a DNA sequence or group of DNA 
sequences that provide an expression control element for a gene and to which RNA 
polymerase specifically binds and initiates RNA synthesis (transcription) of that gene. 
An "inducible promoter" is a promoter where the rate of RNA polymerase binding and 

LQ initiation is modulated by external stimuli. Such stimuli include light, heat, anaerobic 
stress, alteration in nutrient conditions, presence or absence of a metabolite, presence of 
a ligand, microbial attack, wounding and the like. A "viral promoter" is a promoter 
with a DNA sequence substantially similar to the promoter found at the 5' end of a viral 
gene. A typical viral promoter is found at the 5 f end of the gene coding for the p21 

15 protein of MMTV described by Huang et al., Cell 27:245, 1981. A "synthetic 
promoter" is a promoter that was chemically synthesized rather than biologically 
derived. Usually synthetic promoters incorporate sequence changes that optimize the 
efficiency of RNA polymerase initiation. A "constitutive promoter" is a promoter 
where the rate of RNA polymerase binding and initiation is approximately constant and 

20 relatively independent of external stimuli. Examples of constitutive promoters include 
the cauliflower mosaic virus 35S and 19S promoters described by Poszkowski et al., 
EMBO J. .5:2719, 1989 and Odell et al., Nature 575:810, 1985. A "temporally 
regulated promoter" is a promoter where the rate of RNA polymerase binding and 
initiation is modulated at a specific time during development. Examples of temporally 

25 regulated promoters are given in Chua et al., Science 244:174-181, 1989. A "spatially 
regulated promoter" is a promoter where the rate of RNA polymerase binding and 
initiation is modulated in a specific structure of the organism such as the leaf, stem or 
root. Examples of spatially regulated promoters are given in Chua et al., Science 
244:174-181, 1989. A "spatiotemporally regulated promoter" is a promoter where the 

30 rate of RNA polymerase binding and initiation is modulated in a specific structure of 
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the organism at a specific time during development. A typical spatiotemporally 
regulated promoter is the EPSP synthase-35S promoter described by Chua et al., 
Science 244:174-181, 1989. 

RNA: Ribonucleic acid. 
5 rRNA: Ribosomal RNA. 

Secretion signal: A contiguous series of amino acids attached to or 
contained within a polypeptide which enables secretion of the polypeptide from a cell. 
The amino acids may or may not be cleaved from the polypeptide during the process of 
secretion. 

10 Seed: A viable dormant embryo enclosed in a testa that is derived from 

the integument(s). The embryo is associated with food reserve organs. 

Single-chain antigen-binding (SCAB) protein: A polypeptide composed 
of an immunoglobulin light-chain variable region amino acid sequence (VJ tethered to 
an immunoglobulin heavy-chain variable region amino acid sequence (V H ) by a peptide 
15 that links the carboxyl terminus of the V L sequence to the amino terminus of the V„ 
sequence or by a peptide that links the amino terminus of the V L sequence to the 
carboxyl terminus of the V H sequence. 

Single-chain antigen-binding (SCAB) protein-coding gene: A 
recombinant gene coding for a single-chain antigen-binding protein. 
20 Structural gene: A gene coding for a polypeptide and being equipped 

with a suitable promoter, termination sequence and optionally other regulatory DNA 
sequences, and having a correct reading frame. 

T-DNA: A segment of transferred DNA specific for plants. 
Ti-plasmid: Tumor-inducing plasmid specific for plants. 
25 Ti-DNA: A segment of DNA from Ti-plasmid. 

Transfection: A process whereby foreign DNA is introduced into a 
eukaryotic cell. The process may or may not result in cellular transformation. 

Transformation: A process whereby foreign DNA is introduced into the 
genome or cytoplasm of an organism resulting in the expression of a trait that did not 
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previously exist in the organism. Examples of added traits are resistance to toxic 
chemicals and expression of foreign proteins not normally produced by the organism. 



Immunoglobulin Binding Proteins (IgBP) Polypeptides 
5 As noted above, IgBPs comprise one or more polypeptides in which 

amino acids are linked by covalent peptide bonds. In general, an IgBP (i) comprises an 
amino acid sequence that is at least 75% identical to at least one framework region of a 
native immunoglobulin molecule {e.g., IgM, IgG, IgA, IgD, IgE, IgY, kappa or lambda) 
and (ii) is a functional binding protein. Sequence identity may be determined using any 

10 of a variety of well known algorithms, which may be readily optimized by those of 
ordinary skill in the art. One such algorithm is employed by the Align program 
described by Dayhoff et al., Meth. Enzymol 97:524-545, 1983. A functional binding 
protein, as discussed above, specifically binds to a ligand with a K D <10~ 6 moles/liter 
(preferably <10" 7 moles/liter). K D may be readily determined using well known assays. 

15 An IgBP may comprise a single immunoglobulin chain or fragment 

thereof, multiple identical immunoglobulin chains or fragments thereof, or multiple 
non-identical immunoglobulin chains or fragments thereof. IgBPs include, for 
example, single chain antigen binding proteins, Fabs and Fvs. Other IgBPs are heavy 
chain binding proteins (C H BPs), which comprise multiple combining sites composed of 

20 amino acid residues derived from the constant region of an immunoglobulin heavy 
chain and a variable region from any source (e.g., either heavy or light chain, but not 
both). In a preferred embodiment the variable region is derived from a heavy chain. 
C H BPs further comprise an amino acid sequence that is at least 75% identical to a 
constant region tailpiece of a mu or alpha chain of a native immunoglobulin heavy 

25 chain. A C H BP may further comprise one or more J chains, which can serve to link 
other component polypeptides. Representative C„BPs include proteins assembled from 
four alpha chains and one J chain, from twelve mu chains or from ten mu chains and at 
least on J chain. 

IgBPs may be made up of component polypeptides linked by covalent 
30 bonds, preferably disulfide bonds. Preferred components comprise one or more 
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portions of immunoglobulin molecules such as heavy chains and fragments thereof, 
light chains and fragments thereof, J chains and fragments thereof and secretory' 
components and fragments thereof. A component polypeptide may comprise a native 
immunoglobulin sequence or a variant of such a sequence. As noted above, an "IgBP 
polypeptide" is any polypeptide that is a functional IgBP or an IgBP component. 

For preparation of an array, a library of polynucleotides encoding IgBP 
polypeptides is generally employed. Such a library generally comprises at least two 
different polynucleotides (preferably at least 10, 100, 1,000 or 10,000 different 
polynucleotides), each of which encodes a different IgBP or component thereof. An 
IgBP is different from another IgBP if there are one or more differences between the 
amino acid sequences of the IgBPs. Preferably, such differences result in differences in 
the affinity of the IgBP for a respective substrate, ligand or epitope. Preferred libraries 
encode variants of an original polynucleotide. Such a library may be derived from the 
original polynucleotide by a process of mutagenesis that may occur in vitro {e.g., using 
commonly available enzymes and mutagenic oligonucleotides) or in vivo {e.g., by B 
cell maturation). Such a library generally comprises polynucleotides encoding variants 
that differ from an original immunoglobulin by one or more amino acid substitutions 
and/or deletions, such that each variant retains at least 75% identity, preferably at least 
95% identity, to the original immunoglobulin. Certain such libraries may contain 
polynucleotides encoding immunoglobulin variants that differ from the original 
immunoglobulin only in one or more point mutations. Polynucleotides in a library are 
preferably present within a vector that facilitates transformation of target eukaryotic 
cells, and subsequent expression of binding protein or component thereof in 
transformed cells. 

There are a variety of sources for nucleic acids encoding populations of 
immunoglobulin binding proteins. For example, immunoglobulins may be derived 
from the B cells of an immunized host. Such a population of immunoglobulin binding 
proteins is derived from a multitude of cells each expressing a different 
immunoglobulin binding protein. Populations of immunoglobulin binding proteins or 



WO 01/83806 



PCT/US01/14349 



26 

potential immunoglobulin binding proteins can also be derived from the mutagenesis of 
unique sequences of immunoglobulins. 

Methods for isolating polynucleotides encoding a population of IgBPs 
■ are well known in the art. See, for example, Molecular Cloning: A Laboratory Manual, 
5 Sambrook et al., eds., Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. (1989) 
and Antibodies: A Laboratory Manual, Harlow and Lane, eds., Cold Spring Harbor, 
N.Y. (1988). As noted above, particularly preferred polynucleotides encode 
immunoglobulin heavy and light chain variable regions, or portions thereof. Such 
polynucleotides may be isolated from cells obtained from a vertebrate, preferably a 

10 mammal, which has been immunized with an antigenic ligand (antigen) against which 
activity is sought (/.<?., a preselected antigen). The immunization can be carried out 
conventionally and antibody titer in the animal can be monitored to determine the stage 
of immunization that corresponds to the affinity or avidity desired. Partially immunized 
animals typically receive only one immunization and cells are collected therefrom 

15 shortly after a response is detected. Fully immunized animals display a peak titer that is 
achieved with one or more repeated injections of the antigen into the host mammal, 
normally at two to three week intervals. Usually three to five days after the last 
challenge, the spleen is removed and the genes coding for immunoglobulin heavy and 
immunoglobulin light chains are isolated from the rearranged B cells present in the 

20 spleen using standard procedures. See Current Protocols in Molecular Biology, 
Ausubel et aL, eds., John Wiley and Sons, New York (1987) and Antibodies: A 
Laboratory Manual, Harlow and Lane, eds., Cold Spring Harbor, N.Y. (1988). 

In addition to the spleen, rearranged B cells can be derived from the bone 
marrow of individuals who have been exposed to specific antigens. For example, 

25 patients in hospitals can be exposed to infectious organisms that they would not 
ordinarily encounter. Frequently, these patients mount an immune response that results 
in B cell maturation and deposition in bone marrow of B cells expressing antibodies 
that neutralize the pathogen. In addition to nosocomial exposure, memory B cells are 
deposited in response to a variety of infections, including HIV, HPV, HSV and CMV. 
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Genes en coding V H and V L polypeptides can be derived from cells 
producing IgA, IgD, IgE, IgG or IgM, and most preferably from IgG producing cells. 
Methods for preparing fragments of genomic DNA from which immunoglobulin 
variable region genes can be cloned are well known in the art. See for example, 
Herrmann et al., Methods in Enzymol. 752:180-183, 1987; Frischauf, Methods in 
Enzymol 752:183-190, 1987; and Frischauf, Methods in Enzymol. 752:199-212, 1987. 

Probes useful for isolating polynucleotides encoding immunoglobulin 
products include the sequences encoding constant portions of the V H and V u sequences 
encoding the framework regions of V H and V L and probes for the constant region of the 
entire rearranged immunoglobulin gene, these sequences being obtainable from 
available sources. See, for example, Early and Hood, Genetic Engineering, Setlow and 
Hollaender eds., Vol. 3:157-188, Plenum Publishing Corporation, New York (1981); 
and Kabat et al., Sequences of Immunological Interest, National Institutes of Health, 
Bethesda, Md. (1991). Polynucleotides encoding a polypeptide subunit or component 
of an IgBP can be isolated from either the genomic DNA containing the gene 
expressing the polypeptide or the messenger RNA (mRNA) that codes for the 
polypeptide. The use of mRNA is preferred, due to the difficulty in juxtaposing 
sequences of genomic DNA that encode the polypeptide, where the sequences are 
separated by introns. In such cases, the DNA fragment(s) containing the proper exons 
must be isolated, the introns excised, and the exons spliced together in the proper order 
and orientation. Methods for isolating mRNA coding for peptides or proteins are well 
known in the art. See, for example, Current Protocols in Molecular Biology, Ausubel et 
al., John Wiley and Sons, New York (1987); "Guide to Molecular Cloning Techniques", 
in Methods In Enzymology, Volume 152, Berger and Kimmel, eds. (1987); and 
Molecular Cloning: A Laboratory Manual, Sambrook et al., eds., Cold Spring Harbor 
Laboratory, Cold Spring Harbor, N.Y. (1989). 

Mutagenesis can be. used to generate a multitude of polynucleotides 
encoding different variants of native IgBPs, which can then be resolved by expression 
in an array of eukaryotic cells or plant cells or plants as provided herein. Mutant 
proteins may be obtained in which one or more specific changes in the codons of a gene 
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of choice are introduced. One or a few polypeptides are then expressed for evaluation 
of binding properties. At the other extreme is random mutagenesis, by means of 
relatively nonspecific changes of codons at a variety of sites in the gene of choice. 
Arrays of plants and plant cells, as well as other eukaryotic cells, can be used for 
5 functional screens of mutant IgBPs. Preferably, polynucleotides within such arrays 
encode variants that differ from a native IgBP sequence .in one or more amino acid 
substitutions and/or deletions, such that each variant retains at least 75%, preferably at 

least 95%, identity to the native IgBP. Particularly preferred are arrays in which each 

polynucleotide differs from the native IgBP in one or more point mutations. 

10 Mutagenesis is also a process that occurs naturally during the development of 
antibodies. During the course of B cell maturation, antibody-encoding genes are 
recombined, selected, and mutated to produce plasma cells that encode antibodies with 
higher affinity for an antigen compared to the original polynucleotide encoding the 
antibody with specificity for the same antigen. 

15 Regardless of the precise type of IgBP, array polynucleotides are 

formulated so as to permit entry into, and replication within, a target host cell. .For 
certain transfection techniques, the polynucleotides are cloned into a suitable expression 
vector. Any vector can be used for such transfection, provided that the vector is capable 
of the transcription of IgBP genes as well as selectable markers in a target cell. Typical 

20 expression vectors useful for expression of genes in plants are well known in the art and 
include vectors derived from the tumor-inducing (Ti) plasmid of Agrobacterium 
tumefasciens described by Rogers et al., Meth: in Enzynjdi. 753:253-277, 1987. 
However, several other expression vector systems are known to function in plants. See, 
for example, Register et al., Plant Mol Biol 25:951-961, 1994; Verch et al., J. Immunol 

25 :: . Meth 220:69-75, 1998. 

Alternatively, for certain transfection techniques an expression vector is 
unnecessary or undesirable. In these cases, either single or multiple DNA fragments 
containing the polypeptide coding genes linked to plant expression control elements 
(expression cassettes) are introduced directly into target plant cells. The DNA 
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fragment used to transfect plant cells may include either single or multiple polypeptide 
coding genes. 

Suitable expression vectors and cassettes generally contain expression 
control elements, which include the promoter. The polypeptide coding genes are 
operatively linked to the expression vector or cassette to allow the promoter sequence to 
direct RNA polymerase binding and synthesis of the desired polypeptide coding gene. 
Useful in expressing the polypeptide coding gene are promoters that are inducible, viral, 
synthetic, constitutive, temporally regulated, spatially regulated and/or spatiotemporally 
regulated. The choice of expression vector, and ultimately the selection of a promoter 
to which the polypeptide coding gene is operatively linked, depends directly (as is well 
known in the art) on the functional properties desired (e.g., the location and timing of 
protein expression and the host cell to be transformed). However, an expression vector 
or cassette useful in practicing the present invention is at least capable of directing the 
replication, and preferably also the expression, of the polypeptide coding gene included 
15 in the DNA segment to which it is operatively linked. 

In preferred embodiments, the expression vector or cassette used to 
express the polypeptide-coding genes includes a selection marker that is effective in a 
target cell, preferably a drug resistance selection marker. Alternatively, a separate 
vector or expression cassette comprising a selection marker may be co-transfected. A 
preferred drug resistance marker is the gene whose expression results in kanamycin 
resistance (e.g., the chimeric gene containing the nopaline synthase promoter, Tn5 
neomycin phosphotransferase II and nopaline synthase 3' nontranslated region described 
by Rogers et al., in Methods For Plant Molecular Biology, a Weissbach and H. 
Weissbach, eds., Academic Press Inc., San Diego, Calif. (1988)). Useful plant 
25 expression vectors include pMON 530, pKYLX and pUC. Another preferred drug 
resistance marker is the gene for phosphinothricin acetyltransferase, PAT, which 
confers resistance to Liberty herbicide. Alternatively, the selection marker could permit 
the growth of the transfected cells or tissue on a alternative substrate present in the 
medium (e.g., xylA gene; Haldrup et al., Plant Mol Bio 3 7(2) :2 87-296, 1998). 
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A variety of methods have been developed to operatively link DNAs to 
vectors via complementary cohesive termini. For instance, complementary 
homopolymer tracks can be added to the DNA segment to be inserted and to the vector 
DNA. The vector and DNA segment are then joined by hydrogen bonding between the 
5 complementary homopolymeric tails to form recombinant DNA molecules. 
Alternatively, synthetic linkers containing one or more restriction endonuclease sites 
can be used to join the DNA segment to the expression vector. The synthetic linkers are 
attached to blunt-ended DNA segments by incubating the blunt-ended DNA segments 
with a large excess of synthetic linker molecules in the presence of an enzyme that is 

10 able to catalyze the ligation of blunt-ended DNA molecules, such as bacteria phage T4 
DNA ligase. Thus, the products of the reaction are DNA segments carrying synthetic 
linker sequences at their ends. These DNA segments are then cleaved with the 
appropriate restriction endonuclease and ligated into an expression vector that has been 
cleaved with an enzyme that produces termini compatible with those of the synthetic 

15 linker. Synthetic linkers containing a variety of restriction endonuclease sites are 
commercially available from a number of sources including New England BioLabs, 
Beverly, MA. Polymerase chain reaction (PCR) can also be used to introduce 
appropriate restriction sites into polynucleotide sequences as would be needed for 
ligating the polynucleotide into a restriction site in a plant or eukaryotic cell expression 

20 vector. 

Immunoglobulin Binding Proteins (IgBP) Arrays 

A library of polynucleotides as described above may be used to generate 
an array in eukaryotic cells or organisms (e.g., plants or seeds) using standard 

25 transfection techniques appropriate for the cell or organism of interest. In general, a 
library is used to transfect a population of eukaryotic cells such that some or all of the 
cells contain one or more polynucleotides encoding IgBP polypeptides that are not be 
detectably expressed by untransfected cells. Such transfection can result in functional 
transformation, which permits the cells to produce IgBPs and/or components thereof. 

30 The cells may then be grown on an appropriate medium to allow for replication and the 
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ftinctional expression of IgBPs or components thereof. Optionally, a progeny 
population may be created from a sexual cross of transformants expressing IgBP 
components, such that some or all of the progeny population contain polynucleotides 
encoding sufficient IgBP components to form functional IgBPs. Also optionally, 
5 progeny populations may be created from sexual crosses of transformants expressing 
multiple IgBPs such that each member of the progeny population expresses a single 
functional IgBP. Sexual crosses between plants expressing IgBPs, or between wild- 
type plants and plants expressing IgBPs, results in the genetic segregation of the 
various IgBPs among the various progeny seeds or progeny plants. As such, some of 
1 0 the progeny will express a single IgBP. 

Transformed organisms are generally morphologically normal but for 
the presence of the foreign genes in some or all of their cells. The genes can be present 
in one copy or multiple copies in some or all of the cells. The respective gene products 
can be present in substantially all or a portion of the cells {i.e., the products can be 
localized to a cell type, tissue or organ). Transformed cells may be dormant {e.g., non- 
germinating seeds), present in culture or in an intact organism. 

Transgenic cells or organisms (such as plants) may be produced, in one 
embodiment, by introducing into the nuclear genome a mammalian gene or genes that 
code for a multiplicity of IgBPs. In another embodiment, the introduced gene or genes ■ 
remain extrachromasomal after being introduced by viral infection. In each single 
organism or population of cells, one or more copies of the nucleic acid encoding the 
IgBP polypeptide(s) is integrated into the genome or is resident in the cytoplasm. Since 
a multiplicity of open reading frames is used in the transformation, one or more nucleic 
acids encoding IgBP polypeptides with different specificities can be introduced into 
each cell or population of cells. Transformation techniques that result in multiple 
copies of nucleic acids integrated into the nucleus of a plant cell include Agrobacterium 
mediated transformation, biolistic transformation, electroporation, solid particle 
intrusion, lipofection, chemically-induced DNA uptake, microinjection or 
macroinjection. Viral infection can result in a multiplicity of nucleic acids functionally 
30 operative in the cell cytoplasm. 
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In general, recombinant immunoglobulins can be prepared by isolating 
DNA fragments corresponding to the heavy and light chain variable regions of a 
monoclonal antibody and joining them to each other by any one of the standard 
methods known to those of skill in the art and described by Molecular Cloning: A 
5 Laboratory Manual, Sambrook et al., eds., Cold Spring Harbor Laboratory, Cold Spring 
Harbor, N.Y. (1989). In one preferred embodiment, only DNA fragments corresponding 
to heavy chains are used to prepare suitable vectors for cell transformation. In one 
example of cell transformation, the recombinant DNA fragments can be inserted into 
Agr-obacterium transfer vectors such that the genes of interest are inserted into the 

1 0 Agrobocterhim genome. The recombinant Agrobacterium is then used to infect plant 
cells resulting in the production of the polypeptide of interest In another example of 
cell transformation, the recombinant DNA fragments can be inserted into baculovirus 
transfer vectors such that the genes of interest are inserted into the viral genome in lieu 
of the baculovirus polyhedron gene. The recombinant virus is then used to infect insect 

1 5 cells resulting in the production of the polypeptide of interest. 

An advantage of using insect cells that utilize recombinant baculoviruses 
for the production of IgBPs is that the baculovirus system allows production of mutant 
antibodies as well as combinatorial expression of immunoglobulin with other 
polypeptides (such as J chain) much more rapidly than stably transfected mammalian or 

20 plant cell lines. In addition, insect cells have been shown to correctly process and 
glycosylate eukaryotic proteins. Finally, the baculovirus expression of foreign protein 
has been shown to constitute as much as 50-75% of the total cellular protein late in 
viral infection, making this system an excellent means of producing milligram 
quantities of recombinant immunoglobulins. 

25 . The use of baculovirus Autographica californica nuclear polyhedron 

virus (AcNPV) and recombinant viral stocks in Spodoptera frugiperda (Sf9) ceils to 
prepare large quantities of protein has been described by Smith et al., Proc. Natl Acad. 
Sci. U.S.A. 82: 8404-8408, 1985, and Summers and Smith, Bulletin B - Texas 
Agricultural Experiment Station, May, 1987. A preferred method of preparing 

30 recombinant heavy chains covalently linked to J chain or the light chain constant region 
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is through the expression of DNA encoding recombinant heavy chain, J chain, and light 
chain constant region via the baculoviral expression system in Sf9 insect cells. 

Isolated DNA fragments that encode preferred genes are then inserted 
into the baculovirus transfer vectors. A preferred transfer vector is based on pAc360. 
5 The DNA fragments are digested, purified and ligated into unique restriction sites in the 
vector using standard techniques known to those of skill in the art and described by 
Molecular Cloning: A Laboratory Manual, Sambrook et al., eds., Cold Spring Harbor 
Laboratory, Cold Spring Harbor, N.Y. (1989). 

Recombinant plasmid vectors are then co-transfected with linearized 

10 wild-type AcNPV into Sf9 cells. Cotransfection is preferentially accomplished using 
cationic liposomes, which are commercially available {e.g., from Invitrogen). Sf9 cells 
infected with occlusion-negative, recombinant viruses are then identified and can be 
grown in the desired quantity and under appropriate conditions such that large 
quantities of the desired binding protein are produced. 

15 Methods for introducing polypeptide coding genes into mammalian cells 

have been described and are generally familiar to one skilled in the art. Such methods 
include, but are not limited to, liposome-mediated transformation, calcium phosphate 
transfection, electroporation, diethylamoniethyl dextran carrier and viral infection. It 
will be readily apparent that an optimal method of introducing genes into a particular 

20 eukaryotic cell species may not necessarily be the most effective for another species. 

The advantage of using certain viruses for mammalian cell 
transformation is that a variety of mammalian cell lines are potential targets of viral 
infection. Recombinant Sindbis virus for example, allows rapid, high-level expression 
of heterologous proteins in mammalian cell lines as well as avian, reptilian, mosquito 

25 and Drosophila cells. The virus inhibits host protein synthesis allowing rapid 
identification and purification of the transgenic protein. 

Isolated DNA fragments are introduced into a vector containing an 
appropriate polylinker site downstream from a promoter and upstream from a poly-A 
tail. A useful vector is pSinRepS (Invitrogen). The recombinant vector is then 

30 linearized using appropriate restriction sites downstream from the poly-A tail. RNA 
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transcripts are then produced (using, for example, the SP6 promoter) containing non- 
structural genes for in vivo replication of the recombinant RNA and the promoter and 
transgene of interest. The RNA is then used to transfect mammalian or other cells, such 
as baby hamster kidney (BHK) cells. Protein expression can be assayed 4-72 hours post 
5 infection. Alternatively, viral particles can be harvested and used for infection of 
another cell line. 

In certain embodiments, an array comprises plants or cells transformed 
with polynucleotides encoding immunoglobulin heavy chains and immunoglobulin light 
chains, such that antibody molecules are produced. Within such arrays, individual 

10 immunoglobulin heavy and light chains produced by each plant or plant cell may 
associate with each other and assume a conformation having an antigen binding site 
specific for a -preselected or predetermined antigen, as evidenced by its ability to be 
competitively inhibited. When the binding protein is an antigen binding protein, its 
affinity or avidity is generally greater than 10 5 M' 1 , preferably greater than 10 6 M'\ and 

15 more preferably greater than 10 8 M~\ Immunoglobulins for use in such embodiments 
may generally be derived from the B cells of an immunized host, each of which express 
a different IgBP. 

Similarly, an IgBP array may comprise polynucleotides that encode 
portions of immunoglobulin heavy chains and portions of immunoglobulin light chains. 

20 The individual immunoglobulin heavy and light chain portions in each plant or plant 
cell may associate with each other and assume a conformation having an antigen 
binding site specific for a preselected or predetermined antigen. The antigen binding 
site on a Fab fragment has a binding affinity or avidity similar to the antigen binding 
site on an immunoglobulin molecule. Likewise, the antigen binding site on a SCAB 

25 protein has a binding affinity or avidity similar to the antigen binding site on an 
immunoglobulin molecule. Alternatively, an IgBP array may comprise polynucleotides 
that encode immunoglobulin heavy and light chain fragments that may associate within 
the plant cell to form Fv fragments with a biologically active conformation that has a 
binding site specific for a preselected or predetermined antigen. The antigen binding 
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site on individual Fv fragments has an affinity or avidity for an antigen similar to the 
affinity displayed by the antigen binding site present on an immunoglobulin molecule. 

Within certain preferred embodiments, IgBP arrays (preferably C H BP 
arrays) may be prepared in plants, plant cells and/or seeds. Such arrays may be 
5 prepared in any plant or plant cell that can be conveniently transformed. Suitable plant 
types include, but are not limited to, Arabidopsis, tobacco, Lemna (duckweed), com, 
rice and Chlamydomonas. Techniques for transforming these and other plant types are 
well known in the art, and include Agrobacterium-mediated plant transformation, 
protoplast transformation, gene transfer into pollen, injection into reproductive organs, 
10 gene transfer using a plant cell infective virus or bacterium, mechanical disruption of 
plant cells containing no cell walls in the presence of DNA and solid particulates (e.g., 
glass beads), injection into immature embryos, acceleration into the plant cells on solid 
particles or fibers, or electroporation. As noted above, libraries of polynucleotides 
encoding more than one IgBP are used for transfection. Transfection may result in 
15 introduction of one or more polynucleotides into the nuclear genome of a plant cell. 
Alternatively, the introduced polynucleotide(s) may remain extrachromosomal after 
being introduced by viral infection. Preferably, the transfection technique results in a 
population of cells containing multiple functionally operative polynucleotides encoding 
IgBPs, or components thereof. Such techniques (that result in integration into the. 
20 nuclear genome) include Agrobacterium-mediated transformation, biolistic 
transformation, direct DNA transfer, fiber-mediated transformation, microinjection, 
macroinjection and electroporation. Viral infection can result in multiple 
polynucleotides encoding IgBPs, or components thereof, functionally operative in the 
plant cell cytoplasm. It will be apparent that the selection of an optimal method for 
25 introducing genes will depend, in part, on the particular plant species. 

The use of Agrobacterium-mediated expression vectors to introduce 
DNA into plant cells is well known in the art. See, for example, the methods described 
by Fraley et al., Biotechnology 3: 629, 1985 and Rogers et al., Methods in Enzymology, 
753:253-277, 1987. Further, the integration of the Ti-DNA is a relatively precise 
process resulting in few rearrangements. The region of DNA to be transferred is 
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defined by the border sequences and intervening DNA is usually inserted into the plant, 
genome as described by Spielmann et al., Mol. Gen. Genet 205:34, 1986 and Jorgensen 
etal, Mol. Gen. Genet. 207:471, 1987. Modern Agrobacterium transformation vectors 
are capable of replication in Escherichia coli as well as Agrobacterium, allowing for 
5 convenient manipulations, as described by Klee et al., in Plant DNA Infectious Agents, 
T. Hohn and J. Schell, eds., Springer-Verlag, New York (1985) pp. 179-203. Further 
recent technological advances in vectors for Agrobacterium-mediated gene transfer 
have improved the arrangement of genes and restriction sites in the vectors to facilitate 
construction of vectors capable of expressing various polypeptide coding genes. The 
10 vectors described by Rogers et ah, Methods in Enzymology 153:253, 1987, for example, 
have convenient multi-linker regions flanked by a promoter and a polyadenylation site 
for direct expression of inserted polypeptide coding genes, and are suitable for present 
purposes. 

In those plant species where Agrobacterium-mediated transformation is 

15 efficient, this is the method of choice because of the facile and defined nature of the 
gene transfer (see Horsch et al., Science 227:1 129-1231, 1985; Feldmann and Marks, 
Mol. Gen. Genet. 208:1-9, 1987; Chang et al., Plant J. 5:551-558, 1994; Bechtold et al., 
Acad. Set Paris Science de la Vie 3 J 6:1 194-1 199, 1993; Hansen and Chilton, Curr Top 
Microbiol Immunol 240:21-51, 1999). Few monocots appear to be natural hosts for 

20 Agrobacterium. However, transgenic plants may be produced in a variety of monocots 
using Agrobacterium vectors (as described, for example, by Bytebier et al., Proc. Natl. 
Acad. Sci. U.S.A. 84:5345, 1987; Hiei et al., Plant J. (5(2) :27 1-282, 1994; Ishida et al., 
Nat. BiotechnoL 1 4 (6) :7 45-7 50, 1996; Hansen and Chilton, Curr Top Microbiol 
Immunol 240:21-57, 1999). Alternatively, vector-free or direct DNA transfer methods 

25 have been developed to transform a variety of plant species. As an example, 
transformation of plant protoplasts can be achieved using methods based on calcium 
phosphate precipitation, polyethylene glycol treatment, electroporation, microinjection, 
and combinations of these treatments. See, for example, Potrykus et al., Mol. Gen. 
Genet. 199:183, 1985; Lorz et al., Mol Gen. Genet. 199: 178, 1985; Fromm et al., 

30 Nature 319:791, 1986; Uchimiya et al., Mol. Gen. Genet. 204:204, 1986; Callis et al., 
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Genes and Development 7:1183, 1987; Marcotte et aL, Nature 335:454, 1988; Roest 
and Gilissen, Acta Bot Need 5*1-23, 1989; Davey et aL, Plant Mol Biol 73:273-285, 
1989; Shimamoto et aL, Nature 335:274-276, 1989; and Datta et al., Bio/Technology 
5:736-740, 1990. Silicon carbide-mediated transfonnation may be used to generate 
stably transformed, fertile maize plants (Frame, et aL, The Plant Journal for Cell and 
Molecular Biology 6:941-948, 1994) as well as tobacco cell cultures (Kaeppler, et al., 
Theoretical and Applied Genetics 84:560-566, 1992), Agrostis alba cell cultures (AsanJ 
et al., Plant Cell Reports 73:243-246, 1994), and Chlamydomonas reinhardtii 
(Dunahay, BioTechniques 75:452-460, 1993). Application of these systems to 
different plant species often depends upon the ability to regenerate that particular plant 
species from protoplasts or callus. Illustrative methods for the regeneration of cereals 
from protoplasts are described, for example, in Fujimura et al., Plant Tissue Culture 
Letters 2:74, 1985; Toriyama et al., Theor Appl. Genet. 73:16, 1986; Yamada et al., 
Plant Cell Rep. 4:85, 1986; Abdullah et al., Biotechnology 4:1087, 1986. Register etal.,' 
Plant Mol. Bio. 25(6):95\-96\, 1994; Blackhall N.W. et al., "Callus Initiation, 
Maintenance, and Shoot Induction in Rice," in Plant Cell Culture Protocols, Hall R.D. 
(editor), Humana Press (Totowa, NJ), 1999, pgs. 19-30. 

To transform plant species that cannot be successfully regenerated from 
protoplast or callus, other ways to introduce DNA into intact cells or tissues can be 
used. For example, regeneration of cereals from immature embryos or explants can be 
effected as described by Dasil, Biotechnology 6:397, 1988. In addition, "particle gun" 
or high-velocity microprojectile technology can be used. Using such technology, DNA 
is carried through the cell wall and into the cytoplasm on the surface of small (0.525 
Hm) metal particles that have been accelerated to speeds of one to several hundred 
meters per second (as described in Klein et al., Nature 327:70, 1987; Klein et al., Proc. 
Natl. Acad. Sci. U.S.A. 55:8502, 1988; and McCabe et al., Biotechnology 6:923, 1988). 
The metal particles penetrate through several layers of cells and thus allow the 
transformation of cells within tissue explants. Metal particles may be used to 
successfully transform corn cells and to produce fertile, stably transformed plants in a 
variety of species including model plants such as tobacco (Klein et al., 1988), as well as 
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important crop species such as soybean (McCabe et al., 1988), maize (Fromm et al., 
Bio/technology 8: 833-839,1990; Gordon-Kamm et al, The Plant Cell 2: 603-618 
1990), rice (Christou et al., Bio/technology 9: 957-962, 1991), barley (Wan and 
Lemaux, Plant Physiology 104: 37-48, 1994) and wheat (Vasil et al., Bio/technology 
5 10: 662-674, 1992; Weeks et al., Plant Physiology 102: 1077-1084, 1993). 
Transformation of tissue explants eliminates the need for passage through a protoplast 
stage and thus speeds the production of transgenic plants. 

DNA can also be introduced into plants by direct DNA transfer into 
pollen, as described by Zhou et al., Methods in Enzymology 1 01:433, 1983; Hess, 

10 Intern Rev. Cytol 107:361, 1987; Luo et al., Plant Mol Biol Reporter 5:165, 1988; and 
Saunders et al., Molecular Biotechnology 5:181-190, 1995. Expression of polypeptide 
coding genes can be obtained by injection of the DNA into reproductive organs of a 
plant as described by Pena et al., Nature 325:274, 1987. DNA can also be injected 
directly into the cells of immature embryos, followed by the rehydration of desiccated 

15 embryos as described by Neuhaus et al., Theor. Appl Genet. 75:30, 1987; and 
Benbrook et al., in Proceedings Bio. Expo. 1986, Butterworth, Stoneham, Mass., pp. 
27-54(1986). 

Regardless of the method of transformation, transformed plant cells are 
generally grown on a suitable medium. Cells may be grown on medium that does not 

20 promote differentiation (e.g., as calli), or may be grown such that plants are 
regenerated. For example, some lower plant species, such as Chlamydomonas, do not 
require regeneration and can express immunoglobulin binding proteins immediately 
after selection for the expression of a selectable marker, as a result of growth in an 
appropriate medium. Suitable media for a wide variety of plant cell types are well 

25 known to those of ordinary skill in the art. In general, transformed cells are selected 
based on expression of the selection marker, such that only those cells that contain at 
least one transfected polynucleotide encoding an immunoglobulin binding protein are 
permitted to grow. 

Non-combinatorial arrays expressing heavy chain binding proteins may 

30 be generated using, for example, the MaxBac Baculovirus Expression System 
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(Invitrogen). Briefly, insect cells are co-transfected with recombinant plasmid or 
plasmids and linearized Bac-and-Blue DNA. A viral supernatant fluid is harvested after 
the appropriate infection time and the vims is used to display recombinant plaques on 
agar plates using techniques recommended by the manufacturer. Individual plaques can 
then by used for the evaluation of binding protein expression. Alternatively, heavy 
chain binding protein arrays may be generated using the Sindbis Expression System 
(Invitrogen), allowing the expression of polypeptides in baby hamster kidney (BHK) 
cells and the production of viral particles that can be used to infect a variety of other 
eukaryotic cell lines. 

The regeneration of plants from either single plant protoplasts or various 
explants is well known in the art. See, for example, Methods for Plant Molecular 
Biology, A. Weissbach and H. Weissbach, eds., Academic Press, Inc. (San Diego, CA; 
1988). This regeneration and growth process includes the steps of selection of 
transformant cells and shoots, rooting the transformant shoots and growth of the 
15 plantlets in soil. 

The regeneration of plants containing the foreign gene introduced by 
Agrobacterium tumefasciens from leaf explants can be achieved as described by Horsch 
et al., Science 227:1229-1231, 1985. In this procedure, transformants are grown in the 
presence of a selection agent and in a medium that induces the regeneration of shoots in 
the plant species being transformed, as described by Fraley et al., Proc. Natl. Acad Sci. 
U.S.A. ,50:4803, 1983. This procedure typically produces shoots within two to four 
weeks, and these transformant shoots are then transferred to an appropriate root- 
inducing medium containing the selective agent and an antibiotic to prevent bacterial 
growth. Transformant shoots that rooted in the presence of the selective agent to form 
plantlets are then transplanted to soil to allow the production of roots. These 
procedures will vary depending upon the particular plant species employed, such 
variations being well known in the art. 

An array may consist of a set of seeds generated from transformed 
plants. While such seeds have the potential to germinate into plants that express IgBPs, 
expression does not generally occur when the seeds are dormant (although some' 
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expression may occur during seed development). Nonetheless, such a set of seeds is 
considered an array. 

Within arrays, the IgBPs or components thereof may be secreted from 
cells or may accumulate in one or more intracellular compartments. Secreted proteins 
5 may or may not be contained by a cell wall. For example, the CW15 mutant of 
Chlamydomonas does not express a cell wall and IgBPs containing leader sequences are 
secreted directly into the growth medium. In this instance, the desired IgBP can be 
identified by assaying a portion of the growth medium for the desired characteristic. In 
higher plants, secreted IgBPs are generally contained by the cell wall and may 

10 accumulate in the apoplastic water. A desired IgBP can be assayed after disruption of 
the cell wall using any of a variety of standard mechanical techniques {e.g., mortar and 
pestle homogenization). 

Within certain embodiments, an array of IgBPs in plant cells is derived 
from a progeny population of plant cells {i.e., plant cells resulting from the sexual cross 

15 of transformants expressing IgBP components). In such an array, some or all of the 
progeny population contain polynucleotides encoding IgBP components, such that 
functional IgBPs are expressed in the array. In such arrays, useful IgBP component 
polynucleotides include those that encode a second polypeptide that can autogenously 
associate with a first polypeptide in such a way as to form a biologically functional 

20 IgBP. Examples of such IgBP components are heavy and light chains of antibodies. 

The sexual cross of different members of a plant species has been well 
described by Mendel in 1865 (an English translation of Mendel's original paper 
together with comments and a bibliography of Mendel by others can be found in 
Experiments In Plant Hybridization, Edinburgh, Scotland, Oliver Boyd, eds., 1965). 

25 When the plants are flowering plants, the sexual cross involves contacting viable pollen 
from one population of plants with the stigma of another population of plants of a 
sexually compatible species. When the plant cells are photosynthetic unicellular 
organisms {e.g., Chlamydomonas), the sexual cross involves fusion of cells of + and — 
mating types, followed by meiosis and tetrad formation. Progeny from sexual crosses 

30 can comprise seeds or tetrads. 



n-ift-wifiAi i -> 



WO 01/83806 



PCT/USOl/14349 



41 

Progeny resulting from a sexual cross of transformed plant cells may, 
within certain embodiments., be resolved into an array in which some or all of the 
progeny contain polynucleotides encoding a single IgBP. Briefly, transformed plant 
cells expressing polynucleotides encoding multiple IgBPs can be sexually crossed to 
5 produce a progeny population. For higher plants, the sexual cross involves pollination 
as described above. The source of pollen can be either the same plant (self pollination) 
or a different plant (cross pollination). In some cases pollination is not required for 
seed development. The resulting progeny population contains the genetic segregation 
events of the polynucleotides encoding various IgBPs or components. Some of the 
1 0 progeny may contain polynucleotides encoding a single IgBP. 

For haploid organisms such as Chlamydomonas, resolution of multiple 
binding events may also be accomplished by molecular genetic techniques. Nucleic 
acids encoding IgBPs can be identified by Southern blotting and isolated by PCR using 
known primers. The isolated nucleic acids can then be used for re-transformation of the 
haploid plant cell to produce a secondary array representing the resolved nucleic acids 
encoding IgBPs. In this process, vector DNA encoding IgBPs is amplified using 
oligonucleotides complementary to the boundary regions of the expression cassette. 
The amplified DNA is ideally suitable for ligation into an expression vector for re- 
transformations of a new population of Chlamydomonas cells. Transformants from this 
population contain relatively few different IgBPs and therefore there is a higher 
probability of generating a population of transformed cells wherein each nucleic acid is 
expressed uniquely by an individual transformed cell. 
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Immunoglobulin Binding Protein Array Compositions 

The present invention also provides compositions that comprise an array 
of encapsulated IgBPs. IgBPs may be encapsulated, for example, within plant cells, 
plant cell walls, enteric coatings, coatings and the like. Particularly preferred are' 
compositions containing no more than 10,000 grams of plant material for each 100 
nanograms of IgBP. Such compositions typically contain at least 100 nanograms of 
plant material for each 10 grams of IgBP. In more preferred embodiments, a 
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composition comprises no more than 10,000 grams of plant material for each milligram 
of IgBP, but at least 100 nanograms of plant material for each gram of IgBP and more 
preferably at least one milligram of plant material for each 500 milligrams of IgBP. 

A composition may further comprise substances such as chlorophyll, 
5 synergistic compounds, medicines, compounds derived from medicinal plants and/or 
various pharmaceuticals. 

Assays for Screening Arrays of Immunoglobulin Binding Proteins 

Transgenic arrays as provided herein are useful for the discovery of 

10 IgBPs having desired characteristics. From the transgenic arrays provided herein, 
individual eukaryotic organisms or clones of eukaryotic cells can be identified rapidly, 
enabling the easy access to an economical, high yield process for the large scale 
production of a desired IgBP. 

As noted above, plants are a particularly preferred host organism for the 

15 transgenic arrays described herein. When a sexual cross is performed between two 
populations of plant cells, each expressing IgBPs or components of IgBPs, some or all 
of the progeny can express functional IgBPs that comprise multiple IgBP components. 
Such progeny may be of particular use within functional screens for IgBPs having 
multiple polypeptide subunits. 

20 More specifically, once regenerated plants are adapted to growth in soil, 

small sections of leaf can be removed to assay for the presence of IgBPs. Alternatively, 
the plants can be allowed to set seed by either self-pollination or cross pollination and 
the assays for expression of IgBPs can be performed on individual seeds or portions of 
seeds. When the transformants are individual cells (e.g., Chlamydomonas) the cells 

25 may be grown in solid or liquid medium to obtain a sufficient quantity for performing 
assays. Assays can be immunological (e.g., ELISA) or functional. Functional assays 
may involve, for example, binding of the IgBP to cognate factors immobilized on a 
solid support. When the IgBP is an antibody, the cognate factor is an antigen. Assays 
for determining the number of copies of IgBP as well as the number of different IgBPs 

30 being expressed in the primary transformants include Southern blotting and DNA 



WO 01/83806 



PCT/US01/14349 



43 

sequencing of cloned cDNAs derived from cellular RNA. Description of these assays 
may be found in Antibodies: A Laboratory Manual, Harlow and Lane, eds., Cold 
Spring Harbor, N.Y. (1988) and Molecular Cloning: A Laboratory Manual, Sambrook 
et al., eds., Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. (1989). 

The following Examples are offered by way of illustration and not by 
way of limitation. 
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EXAMPLES 
Example 1 

Isolation and Expression in Maize of Immunoglobulin Heavy Chain-Coding Genes And 
5 Immunoglobulin Light Chain-Coding Genes Derived From B Cells in Bone Marrow 
This Example illustrates the preparation of representative 
immunoglobulin binding protein components for use in preparing an array of 
immunoglobulin binding proteins in plant cells. 

Bone marrow (~10 7 cells) was obtained from donors at local hospitals. 
10 Donors were chosen on the basis of (a) having had Clostridium difficile-associated 
diarrhea within the previous two months and (b) having toxin neutralizing antibodies in 
their serum. Bone marrow was obtained from the subset of this population whose 
diarrhea resolved without the intervention of any further antibiotic therapy. 

Bone marrow cells were washed multiple times in cold PBS then 
15 incubated with combined FITC-labeled Clostridium difficile toxin A and toxin B. This 
procedure was used to identify B cells expressing anti-toxin A or anti-toxin B on their 
cell surfaces. The Clostridium difficile toxins were prepared from cultures of 
Clostridium difficile as described by Lyerly et al., Infect Immun 35:1 147-50, 1982. 

Fluorescent labeling of the toxins was by reaction with fluorescein 
20 isothiocyanate (FITC) using procedures provided by the manufacturer (Pierce). 
Incubation of bone marrow cells with the combined FITC-labeled Clostridium difficile 
toxin A and toxin B was by the procedures described in Hoven et al., J. Immunol. 
Methods 777:2275-84, 1989. The tagged cells were shown to be B cells by double 
staining with phycoerythrin-conjugated anti-human Ig. 
25 Total RNA from the selected bone marrow cells was prepared using the 

QIAamp RNA Blood Mini-Kit system (Qiagen) according to the instructions provided. 

PCR amplification of the isolated RNA employed oligonucleotides 
complementary to conserved 3' and 5' regions of immunoglobulin (IgGl) transcripts as 
previously described in Antibody Engineering, Carl A, Borrebaeck, Ed., W.H. Freeman 
30 and Company, New York, N.Y., 1995. The oligonucleotide primers introduced unique 
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restriction sites at each end of the amplified polynucleotide to allow for ligation into a 
plant expression vector. The amplified portion of the immunoglobulin polynucleotides 
extended from the 5' end encoding the respective FR1 regions to the 3' end encoding 
the stop codon. In addition, appropriate 5' and 3' restriction sites were introduced in 
5 the process of amplification. The amplified heavy and light polynucleotide chains 
therefore corresponded to the first six codons at the 5' end of the RNAs and the last six 
codons of the constant regions with the restriction sites introduced outside of the 
respective terminal codons. This immunoglobulin binding protein array consisting of 
double stranded amplified cDNAs from the sorted cells was prepared for insertion into 

10 a plant expression vector, by converting the ends of the double stranded cDNA to the 
appropriate sticky ends for ligation into the vector (Molecular Cloning: A Laboratory 
Manual, Sambrook et al. 5 eds., Cold Spring Harbor Laboratory, Cold Spring Harbor, 
N.Y. 1989) by digestion with a restriction endonuclease according to the 
manufacturer's protocol (New England BioLabs). 

15 The vector allowed for the expression of both heavy and light chains 

using the ubiquitin promoter and nopaline synthase terminator as described by Ishida et 
al., Nature Biotechnology 14:745-750, 1996 and Register et al., Plant Mol Biol 25:951- 
961, 1994. 

The vector further contained polynucleotides encoding signal sequences 
20 (MDWTGRFLFVVAAATGVQS; SEQ ID NO: 73) upstream from each site for 
insertion of the amplified immunoglobulins. The signal sequence was derived from the 
Kabat database (Kabat et al., Sequences of Immunological Interest, National Institutes 
of Health, Bethesda, Md., 1991) to allow for association of both heavy and light chain 
nascent polypeptides with the endomembrane system of the plant cell. In addition, the 
25 vector contained a multiple cloning site immediately downstream from the signal 
sequence encoding regions. The amplified regions were then ligated into the vector to 
encode a heavy chain polynucleotide consisting of a signal sequence, a diversity of 
variable region and the entire gamma constant region; and a light chain polynucleotide 
consisting of a signal sequence, a diversity of light chain variable region and the entire 
30 kappa constant region. 
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The recombinant population of vectors was then used to transform 
Agrobacterium tumefasciens as described by Ishida et al., Nature Biotechnology 
74:745-75, 1996. The transformed Agrobacterhim was then used to infect immature 
com embryos followed by selection and regeneration of maize plants as described in 
5 Ishida et al. The resulting array of plant cells producing IgBP polypeptides {i.e., IgBPs 
and/or IgBP components) comprised approximately 1 000 plants. 

To locate the position in the array of various IgBP polypeptides, samples 
of leaf tissue were taken from each plant. A variety of assays were performed to 
characterize the presence of IgBPs, the functionality of IgBPs, and the variety of IgBP 

10 components in each transformant. IgBP components (gamma chains or kappa chains) 
were detected by ELISA in 96 well plates. In one set of plates the capture of IgBP 
components was by reaction with immobilized goat anti-human kappa chain antibody 
and the detection was by horse radish peroxidase conjugated goat anti-human kappa 
chain antibody. In another set of plates the capture of IgBP components was by 

15 reaction with immobilized goat anti-human gamma chain antibody and the detection 
was by horse radish peroxidase conjugated goat anti-human gamma chain antibody. In 
a third set of plates the capture of IgBP components was by reaction with immobilized 
goat anti-human gamma chain antibody and the detection was by horseradish 
peroxidase-conjugated goat anti-human kappa chain antibody. Detection of captured 

20 IgBP components was by the procedures previously described (Antibodies: A 
Laboratory Manual, Harlow and Lane, eds., Cold Spring Harbor, N.Y., 1988). The 
results demonstrated that the majority of plants co-expressed gamma and kappa chains. 

The leaf material was further evaluated for the ability of the expressed 
IgBPs to bind the toxin A or toxin B proteins. In these assays, ELISA plates were 

25 coated with either toxin A or B to capture the IgBPs. The ELISA procedures generally 
followed those previously described (Hiatt et al., Nature 342:16-1%, 1989). Leaf 
extracts were incubated in the ELISA wells to allow reaction of potential 
immunoglobulin binding proteins with the immobilized toxins. Screening the entire 
array of approximately 1000 plants as described above identified 10 plants expressing 

30 functional anti-toxin A antibodies and 5 plants expressing functional anti-toxin B 
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antibodies. From these results, it appears that greater than 20% of the total 
polynucleotide population encoded anti-toxin A heavy or light chains. Similarly, 
greater than 14% of the total population encoded anti-toxin B heavy or light chains.' 
The remaining 66% of the polynucleotides derived from the sorted cells may have 
encoded unrelated immunoglobulins. 

To assess the variety of immunoglobulin binding protein components 
contained in those plants that expressed functional anti-toxins, PGR amplification of 
gamma or kappa transcripts were cloned into bacterial transformation vectors as 
described (Antibody Engineering, Carl A, Borrebaeck, Ed., W.H. Freeman and 
Company, New York, NY, 1995). Plasmids derived from each of the bacterial colonies 
were then subjected to automated DNA sequencing as described (DNA Sequencing: 
From Experimental Methods to Bioinformatics, L. Alphey, Bios Scientific Publishers, 
Oxford, England, 1997). The sequences derived from various plants revealed that in 
some cases more than one set of IgBP components was expressed in a single 
15 transformant. 

To segregate the different polynucleotides encoding different IgBPs into 
different progeny plants, the transforms expressing multiple IgBP polypeptides were 
self-pollinated to produce seed sets. The resulting array of seeds was grown into 
individual plantlets, at which time portions of the leaf were removed to assay for the 
presence of functional IgBPs as described above. Since the heavy and light chains were 
co-expressed on the same vector, the genetic segregation potentially identifies 
functional anti-toxin heavy and light chains expressed from the same vector. This is 
useful since the parental plants expressing anti-toxins could have formed antibody from 
heavy and light chains polynucleotides on different vectors co-integrated into the same 
nucleus during Agrobacterium transformation. Of the 15 plants expressing anti-toxin 
antibodies, 14 produced a progeny array where some members of the array expressed 
anti-toxin antibodies indicating co-expression of heavy and light chains from the same 
vector. The single parental plant that produced progeny containing no anti-toxin 
antibodies presumably expressed anti-toxin heavy and light chain polynucleotides from 
different vectors, which co-integrated during transformation but which segregated in 
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the progeny. This observation is consistent with the known ability of Agrobacterium to 
introduce more than one vector into a nucleus during transformation. 

Example 2 

5 Isolation and Expression in Maize of Immunoglobulin Heavy Chain-Coding Genes And 

J Chain-Coding Gene 
This Example illustrates the preparation of representative 
immunoglobulin heavy chain binding protein components for use in generating an array 
of immunoglobulin heavy chain binding proteins in plant cells. 

10 RNA was prepared as described in Example 1 . PCR amplification of the 

isolated RNA employed oligonucleotides complementary to conserved 3' and 5' 
regions of the variable region of the immunoglobulin heavy chain (IgGl) transcripts as 
previously described in Antibody Engineering, Carl A, Borrebaeck, Ed., W.H. Freeman 
and Company, New York, N.Y., 1995. The oligonucleotide primers introduced unique 

15 restriction sites at each end of the amplified polynucleotide to allow for ligation into a 
plant expression vector. The amplified portion of the immunoglobulin polynucleotides 
extended from the 5' end encoding the respective FR1 regions to the 3 5 end encoding 
the first six amino acids of the CHI region. In addition, appropriate 5' and 3' 
restriction sites were introduced in the process of amplification. The amplified heavy 

20 chain polynucleotide chains therefore corresponded to the first six codons at the 5' end 
of the RNAs and the first six codons of the constant regions with the restriction sites 
introduced outside of the respective terminal codons. This immunoglobulin heavy 
chain binding protein array consisting of double stranded amplified cDNAs from the 
sorted cells was prepared for insertion into a plant expression vector, by converting the 

25 ends of the double stranded cDNA to the appropriate sticky ends for ligation into the 
vector (Molecular Cloning: A Laboratory Manual, Sambrook et al., eds., Cold Spring 
Harbor Laboratory, Cold Spring Harbor, NY 1989) by digestion with a restriction 
endonuclease according to the manufacturer's protocol (New England BioLabs). 
Synthetic polynucleotides encoding J chain were additionally prepared for ligation into 

30 the plant vector by PCR amplification as above. Finally, the constant region of an IgG- 
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A hybrid heavy chain (Ma et al., Science 268:716-719) was amplified by PCR to 
contain appropriate restriction sites for ligation into the plant vector downstream from 
the variable region amplified above. 

As in Example 1, the vector allowed for the expression of two open 
reading frames using two ubiquitin promoters and nopaline synthase terminator as 
described by Ishida et al., Nature Biotechnology 7*745-750, 1996 and Register et al., 
Plant Mol Biol 25:951-961, 1994. 

The vector rurther contained a polynucleotide encoding signal sequences 
(MDWTGRFLFVVAAATGVQS; SEQ ID NO: 73) upstream from the site for insertion 
of amplified DNAs and downstream from the first promoter. The signal sequence was 
derived from the Kabat database (Kabat et al., Sequences of Immunological Interest, 
National Institutes of Health, Bethesda, Md., 1991) to allow for association of heavy 
chain nascent polypeptides with the endomembrane system of the plant cell. In 
addition, the vector contained a multiple cloning site immediately downstream from the 
signal sequence codons at the first promoter site and immediately downstream from the 
second promoter site. The amplified heavy chain constant region was first ligated into 
the vector downstream from the signal sequence at the first promoter site. The 
remaining restriction sites located between the signal sequence and the heavy chain 
constant region were used for ligation of the amplified heavy chain variable regions 
prepared above. The amplified J chain (containing a signal sequence) was ligated 
downstream from the second promoter. The resulting vector encoded a heavy chain 
polynucleotide consisting of a signal sequence, a diversity of variable regions and the 
entire gamma-alpha hybrid constant region (Ma et al., Science 268:716-719); and a J 
chain polynucleotide containing a signal sequence. 

The recombinant population of vectors was then used to transform 
Agrobacterium tumefasciens as described above. Regenerated leaf material was 
evaluated as described above with the exception that in one set of ELISA plates both 
the capture and detection antibody recognized human alpha constant regions. In 
another set of ELISA plates capture was with anti-alpha antibodies and detection was 
with anti-J chain antibodies. Evaluation of binding to toxin A or toxin B was performed 
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as described above where leaf extracts were incubated in the ELISA wells to allow 
reaction of potential heavy chain binding proteins with the immobilized toxins. 
Screening the entire array of approximately 1000 plants as described above identified 
50 plants expressing functional binders of toxin A and 40 plants expressing functional 
5 binders of toxin B. From these results, it appears that the non-combinatorial approach 
to isolating binding proteins containing no light chain variable regions yields a higher 
proportion of transformation events expressing binding proteins compared to the 
combinatorial approach involving the random pairing of heavy and light chains. The 
ability to detect these binding proteins appears to be due to their assembly into a 
1 0 polymeric molecular structure with enhanced avidity for the toxins. 

Example 3 

Preparation of an Array of Functional Immunoglobulin BindinR Proteins from Binding 
Protein Components by a Sexual Cross of Higher Plants 

15 This Example illustrates the generation of an array of plants that express 

functional immunoglobulin binding proteins from plant populations that express 
different immunoglobulin binding protein components. 

cDNAs encoding heavy and light chains derived from the B cell 
populations described above were cloned separately into vectors allowing for 

20 expression of heavy chains and light chains in different plant populations. The 
procedures described above were performed to generate plant transformants expressing 
arrays of heavy chains or light chains (immunoglobulin binding protein components). 
In order to form an array of immunoglobulin binding proteins, pollen from the light 
chain transformants was used to pollinate the heavy chain transformants to produce a 

25 seed set. The seeds were then grown into individual plantlets and subjected to the 
analysis for identification of immunoglobulin binding proteins described above. 
Results similar to Example 1 were obtained indicating that the random assortment of 
heavy and light chains by cloning them into different vectors followed by assembly 
upon crossing plants is comparable to cloning heavy and light chains into the same 

30 vector followed by assembly in primary transformants. 
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Example 4 

Immunoglobulin Binding Protein Array in Chlamvdomonas reinhardtii 

This Example illustrates the preparation of an immunoglobulin binding 
protein array in Chlamydomonas reinhardtii. 

A. Transformation techniques . Cells were transformed using the glass 
bead protocol as previously described (Kindle, Proc. Natl. Acad. Sci. USA 57:1228^ 
1232, 1990). The transformation procedure described below uses the argininosuccinate 
lyase (ARG7, Debuchy et al., EMBO J. 5:2803-2809, 1989) as the selectable marker 
gene and transformants are selected as arginine prototrophs on medium not 
supplemented with arginine. Cells (arg, CW-15 strain, lacking a cell wall) were first 
grown under constant high light to mid-log phase in non-selective medium. About 5 x 
10 7 cells were resuspended in 300 ul of medium, 150 ul 20% PEG-8000 (5% final 
concentration) and 0.3 g of acid-washed, heat-sterilized glass beads were then added. 
15 Ten ug of test plasmid DNA, along with 10 ug of pARG7.8 DNA (plasmid DNA 
containing ARG7), was added to the cells and the mixture was then agitated for 10 
seconds with a vortex mixer at the highest setting. Alternatively, 5 mg of heat-sterilized 
silicon carbide whiskers was used and vortexed for 90 seconds (Dunahay, 
BioTechniques 75:452-460, 1993). The transformation mixture was then spread onto 
20 selective agar plates. Plates were placed under light and colonies were visible to the 
naked eyes after 7-10 days. All plasmid DNA used was linearized prior, to use. A 
transformation with pARG7.8 DNA alone was always included as a control. 

B. Expression of an Immunoglobulin Binding Protein Array . The IgG 
array isolated by PCR from the B cells selected above (anti-toxin A and B) were 
25 introduced into bacterial vectors (pBluescript). These vectors were used to transform 
bacteria and generate a library of polynucleotides encoding immunoglobulin binding 
protein components (gamma and kappa chains). The sequences of the variable regions 
of 100 polynucleotides (50 gamma chains and 50 kappa chains) was determined by 
automated sequence analysis. Variable regions of each of these polynucleotides were 
30 synthesized to reflect the codon bias of Chlamydomonas. The synthesis, strategy 
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introduced unique Sac and Xho restriction sites at the ends of each DNA. Likewise, a 
single kappa constant region and gamma constant region (sequence derived from Kabat 
et al.) were synthesized with Chlamydomonas codon bias. The synthetic kappa and 
gamma constant region DNAs were first restricted and ligated into separate Xho-EcoRl 
5 restricted pBluescript vectors to create separate vectors containing synthetic 
immunoglobulin constant region polynucleotides. These vectors were cloned and 
purified and were then used as the recipients for Sac-Xho restricted synthetic variable 
region DNAs to create an array of separate gamma and kappa polynucleotides encoding 
the diverse variable regions and constant regions of full length gamma and kappa- 

10 encoding polynucleotides. These vectors were used to transform bacteria from which 
plasmids were purified and automated sequence analysis was used to identify the 
original 100. variable regions comprising the arrays. These plasmids were mixed and 
the Sac-EcoRl restriction fragments were subcloned into the pARG7 vector. The 
pARG7 vector contained a signal sequence derived from the Kabat database 

15 (MDWTGRFLFVVAAATGVQS; SEQ ID NO: 73) upstream from Sac-EcoRl sites. 
The ligation product created full length heavy and light chain polynucleotides capable 
of co-expression with a signal sequence to direct polypeptide synthesis to the 
endomembrane system of the cell. Colonies on agar plates were tested for expression of 
antibodies using a nitrocellulose lift assay. Nitrocellulose circles were placed on the 

20 agar plates containing colonies on order to adsorb sufficient antibody onto the paper to 
allow for functional detection. 



Example 5 

Expression of Heavy Chain Binding Protein Genes in Insect Cells 

25 

This example illustrates the generation of an array of insect cells that 
express functional heavy chain binding proteins. 

The heavy chain and J chain PCR products derived from B cells as 
described above were additionally manipulated to allow for expression in insect cells. 
30 In general, the double stranded cDNA from the selected B cells described above was 
prepared for insertion into an insect expression vector, by converting the ends of the 
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double stranded cDNA to the appropriate sticky ends for ligation into the vector 
(Molecular Cloning: A Laboratory Manual, Sambrook et al., eds., Cold Spring Harbor 
Laboratory, Cold Spring Harbor, N.Y. 1989) by digestion with a restriction 
endonuclease according to the manufacturer's protocol (New England Biolabs). The 
vector (p2Bac, Invitrogen) allowed for the expression of both heavy and J chains using 
the P PI0 and the P PH promoters respectively and appropriate polyadenylation sequences 
(BGH and PH respectively, Invitrogen). The vector therefore was potentially capable of 
co-expression of both a heavy chain and a J chain. The vector further contained a 
polynucleotide encoding a signal sequence derived from the Kabat database (Kabat et 
al., Sequences of Immunological Interest, National Institutes of Health, Bethesda, 
Md.,1991) to allow for association of both heavy and J chain nascent polypeptides wuh 
the endomembrane system of the insect cell. The signal sequence was located 
immediately upstream from the first polylinker which contained convenient restriction 
sites for introduction of additional heavy chain elements. Downstream from the first 
polylinker, polynucleotides encoding the constant region of the human mu heavy chain 
were inserted. The mu heavy chain region obtained by amplification by PCR, 
corresponded to codons encoding the last five amino acids and stop codon at the 3' end 
of the RNA, as well as the first six codons of the constant region. The heavy chain V 
region DNA, encoding the heavy chain portion of the combining sites recognizing 
toxins A or B, was ligated into the first polylinker sight after introduction of the signal 
and constant region elements. Polynucleotides encoding the entire J chain including 
signal sequence containing appropriate restricted 3' and 5' ends to allow ligation into 
the second polylinker site were introduced into the vector last. The vector consequently 
encoded a heavy chain polynucleotide consisting of a signal sequence, a variable region 
and an mu constant region; and a J chain polynucleotide consisting of a signal sequence- 
and the entire coding region of a human J chain. An additional vector was constructed 
which encoded the constant region of the human kappa light chain downstream from a 
unique promoter (P PH in pMelBac A, Invitrogen) and an insect signal sequence. 

The resulting recombinant vectors as well as linearized AcNPV BV 
DNA (Pharmingen) were used to co-infect Sf insect cells using procedures 
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recommended by the manufacturer to generate an array of viral plaques on agar plates. 
Each plaque potentially contained the virions encoding unique heavy chain binding 
proteins as well as the unique heavy chain binding proteins themselves. Virus particles 
from approximately 1000 of the plaques were used to infect High Five™ insect cells 
5 (Invitrogen) in an array of 96 well plates (~10 6 cells per well). 72 hours post-infection 
cells and supernatant were harvested, brought to 1 mL total volume, and 10-100 jaL 
were analyzed for composition and functionality. Identification of desired heavy chain 
and binding functionality can utilize any of a variety of techniques known to those of 
skill in the art such as ELISA, immunoblotting, Western blotting, immunoprecipitation 

10 and such, all of which are described -in Harlow and Lane, "Using Antibodies: A 
Laboratory Manual", Cold Spring Harbor Press, Cold Spring Harbor, New York, 1999. 

Analysis of the antigen binding capability of each supernatant by ELISA 
identified a population of heavy chain binding proteins recognizing either toxin A or 
toxin B. Approximately 5% of the wells containing transformed insect cells expressed 

15 detectible binding proteins recognizing toxin A and 3% expressed binding proteins 
recognizing toxin B. These results are similar to those of Example 2. 

Example 6 

Expression of Heavy Chain BindinR Protein Genes in Mammalian Cells 

20 

This example illustrates the generation of an array of mammalian cells 
that express functional heavy chain binding proteins. 

The heavy chain and J chain PCR products derived from B cells as 
described above were additionally manipulated to allow for expression in baby hamster 

25 kidney (BHK) cells. In general, the double stranded cDNAs from the selected B cells 
described in Example 1 was engineered to be expressed as human mu isotypes as 
described in Example 6 and was further prepared for insertion into the Sindbis 
expression vector, using methods describe above. The vector (pSinRep5, Invitrogen) 
was therefore engineered to allowed for the expression of heavy and J chains using P SG 

30 promoters and appropriate polyadenylation sequences. The vector therefore was 
potentially capable of co-expression of both a heavy chain and a J chain. 
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The vector population was digested with an appropriate restriction 
enzyme. The resulting recombinant linearized vectors were used to generate capped 
RNA transcripts using the Sindbis Expression System kit. The transcripts were then 
used to infect BHK cells. The supernatant medium from the infected cells containing 
recombinant virus particles was then used identify the expressed heavy chain binding 
proteins. Identification of desired heavy chain can binding proteins used the techniques 
described above. 

Analysis of the antigen binding capability of each supernatant by ELISA 
identified a population of heavy chain binding proteins recognizing either toxin A or 
toxin B. Approximately 4% of the wells containing transformed BHK cells expressed 
detectible binding proteins recognizing toxin A and 3% expressed binding proteins 
recognizing toxin B. These results are similar to those of Example 2. 

From the foregoing, it will be appreciated that, although specific 
embodiments of the invention have been described herein for the purpose of 
illustration, various modifications may be made without deviating from the spirit and 
scope of the invention. 
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Claims 

1. A method for preparing an immunoglobulin binding protein 
5 array in plant cells, comprising the steps of: 

(a) transforming a population of plant cells with a library of at least 
two different polynucleotides encoding different immunoglobulin binding protein 
(IgBP) polypeptides that: 

(i) specifically bind to a ligand with a K D <1 0" 6 moles/liter; or 
10 (ii) form one or more disulfide bonds with one or more 

polypeptides in the transfected cell, to generate a binding protein that 
specifically binds to a ligand with a K D <10" 6 moles/liter; 

wherein the IgBP polypeptides (i) comprise four framework regions 
alternating with three complementarity determining regions and (ii) comprise at least 
1 5 one peptide sequence having at least 75% sequence identity to a framework region of a 
native IgM, IgG, IgA, IgD, IgE, IgY, kappa or lambda immunoglobulin molecule; 
and wherein the IgBP polypeptides are not detectably expressed by the plant cells prior 
to transformation; and 

(b) selecting transformed plant cells, and therefrom preparing an 
20 IgBP array in plant cells. 

2. A method according to claim 1, wherein each IgBP polypeptide 
is a functional IgBP. 



25 3. A method according to claim 1, wherein each IgBP polypeptide 

is an IgBP component that, upon disulfide linkage to one or more IgBP components 
encoded by other polynucleotides in the library, forms a functional IgBP. 

4. A method according to claim 1, further comprising the step of: 
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(c) growing the transformed plant cells on a growth medium that 
supports replication of the plant cells, such that functional IgBPs are assembled by the 
plant cells. 

5 5. A method according to claim 1 , further comprising the steps of: 

(c) growing the transformed plant cells on a growth medium to form 

plants; and 

(d) sexually crossing the plants with themselves or other plants to 
generate progeny, such that the progeny comprise polynucleotides encoding IgBP 

10 components sufficient to form a functional IgBP. 

6. A method according to claim 5, wherein the progeny are seeds. 

7. A method according to claim 5, wherein the progeny are plants 
1 5 or plant cells that assemble functional IgBPs. 

8. A method according to claim 1 , wherein the library comprises at 
least 10 different polynucleotides. 

20 9 - A met «od according to claim 1 , wherein the library comprises at 

least 1 00 different polynucleotides. 

1 0. A method according to claim 1 , wherein the library comprises at 
least 1 000 different polynucleotides. 

25 

1 1 - A method according to claim 1 , wherein the library comprises at 
least 10,000 different polynucleotides. 

12. A method according to claim 3, wherein the components 
30 comprise one or more portions of immunoglobulin molecules selected from the group 
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consisting of heavy chains and fragments thereof, light chains and fragments thereof, J 
chains and secretory components. 

13. A method according to claim I, wherein the polypeptides retain 
5 at least 95% amino acid identity to a framework region of a native IgM, IgG, IgA, IgD, 

IgE, IgY, kappa or lambda immunoglobulin molecule. 

14. A method according to claim 1, wherein the framework regions 

are human. 

10 

15. A method according to claim 1, wherein the framework regions 

are murine. 



16. A method according to claim 1, wherein the step of transforming 
15 is performed via Agrobacterium-mediatcd transformation, chemically-induced DNA 
uptake, electroporation, solid particle intrusion, biolistics, microinjection, 
macroinjection, lipofection or viral infection. 



17. A method according to claim 1, wherein the IgBP polypeptides 
20 are secreted from the plant cells. 

18. A method according to claim 1, wherein the plant cells are 
dicotyledonous plant cells. 

25 19. A method according to claim 18, wherein the plant cells are 

tobacco or Arabidopsis plant cells. 

20. A method according to claim 1, wherein the plant cells are 
monocotyledonous plant cells. 

30 

21. A method according to claim 20, wherein the plant cells are com, 
Lemna or rice plant cells. 
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22. A method according to claim 1 , wherein the plant cells are lower 

plant cells. 

23. A method according to claim 22, wherein the plant cells are 
green algae cells. 



24. A method according to claim 23, wherein the plant cells are 
Chlamydomonas rheirihardtii. 



10 



25. A method for preparing a heavy chain binding protein array in 
eukaryotic cells, comprising the steps of: 

(a) transforming a population of eukaryotic cells with a library of at 
least two different polynucleotides, wherein each polynucleotide encodes a different 
15 heavy chain binding protein (C H BP) polypeptide that: 

(i) comprises an amino acid sequence that is at least 75% 
identical to a constant region tailpiece of a mu or alpha chain . of a native 
immunoglobulin heavy chain; 

(ii) comprises multiple combining sites, wherein all of the 
20 combining sites satisfy the same one of the following requirements: 

(1) at least 75% identity to a 25 consecutive amino acid 
portion of an immunoglobulin light chain variable region; or 

(2) at least 75% identity to a 25 consecutive amino acid 
portion of an immunoglobulin heavy chain variable region; and 

25 ("0 either 0) specifically binds to a ligand with a K D <W 6 

moles/liter; or (2) forms one or more disulfide bonds with one or more polypeptides in 
the transfected cell, to generate a C H BP that specifically binds to a ligand with a K D <10" 
6 moles/liter; and 

(b) growing the transformed cells on a medium that permits 
assembly of C H BPs, wherein each C H BP comprises at least four combining sites; 
and therefrom preparing a C H BP array in eukaryotic cells. 
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26. A method according to claim 25, wherein the polynucleotides 
encode immunoglobulin alpha or mu chains. 

5 27. A method according to claim 25, wherein the cells are further 

transformed with one or more polynucleotides encoding polypeptides having sequences 
that are at least 75% identical to a sequence of an immunoglobulin J chain. 

28. A method according to claim 25, wherein each C H BP is 
1 0 assembled from four alpha chains and one J chain. 

29. A method according to claim 25, wherein each C H BP is 
assembled from twelve mu chains. 

j 15 30. A method according to claim 25, wherein each C H BP is 

assembled from ten mu chain and at least one J chain. 

31. A method according to claim 25, wherein the C H BPs or 
components thereof further comprise one or more portions of immunoglobulin 

20 molecules selected from the group consisting of J chains, secretory components and 
light chain constant regions. 

32. A method according to claim 25, wherein the cells are plant cells. 

25 33. A method according to claim 25, wherein the cells are insect 

cells. 

34. A method according to claim 25, wherein the cells are 
mammalian cells. 

30 
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35. A method for preparing a plant C H BP array, comprising the steps 

of: 

(a) transforming a population of plant cells with a library of at least 
two different polynucleotides, wherein each polynucleotide encodes a different C H BP 

5 component that forms one or more disulfide bonds with one or more polypeptides in the 
transformed cell to generate a C„BP that specifically binds to a ligand with a K D <10" 6 
moles/liter, wherein each component: 

(i) comprises an amino acid sequence that is at least 75% 
identical to a constant region tailpiece of a mu or alpha chain of a native 

10 immunoglobulin heavy chain; and 

(ii) comprises multiple combining sites, wherein all of the 
combining sites satisfy the same one of the following requirements: 

(1) at least 75% identity to a 25 consecutive amino acid 
portion of an immunoglobulin light chain variable region; or 
15 (2) at least 75% identity to a 25 consecutive amino acid 

portion of an immunoglobulin heavy chain variable region; 

(b) growing the transformed plant cells on a growth medium to form 

plants; and 

(c) sexually crossing the plants to generate progeny, such that the 
20 progeny comprise polynucleotides encoding C H BP components sufficient to form a 

functional C H BP that comprises at least four combining sites; 

and therefrom preparing a plant C H BP array. 



25 



36. A method according to claim 35, wherein the progeny are seeds. 

37. A method according to claim 35, wherein the progeny are plants 
or plant cells that assemble functional C„BPs. 



38. A method according to claim 25 or claim 35, wherein the library 
30 comprises at least 10 different polynucleotides. 
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39. A method according to claim 25 or claim 35, wherein the library 
comprises at least 100 different polynucleotides. > 

40. A method according to claim 25 or claim 35, wherein the library 
5 comprises at least 1000 different polynucleotides. 

41. A method according to claim 25 or claim 35, wherein the library 
comprises at least 10,000 different polynucleotides. 

10 42. A method according to claim 25 or claim 35, wherein the 

polynucleotides encode polypeptides that retain at least 95% amino acid identity to a 
contant region tailpiece of a mu or alpha chain of a native immunoglobulin heayy 
chain. 

15 43. A method according to claim 25 or claim 35, wherein the step of 

transforming is performed via Agrobacterium-mediaXzd transformation, chemically- 
induced DNA uptake, electroporation, solid particle intrusion, biolistics, microinjection, 
macroinjection, lipofection or viral infection. 

20 44. A method according to claim 25 or claim 35, wherein the binding 

proteins accumulate in an intracellular compartment of the cells!. 

45. A method according to claim 25 or claim 35, wherein the binding 
proteins are secreted from the cells. 

25 

46. A method according to claim 32 or claim 35, wherein the plant 
cells are dicotyledonous plant cells. 

47. A method according to claim 46, wherein the plant cells are 
30 tobacco or Arabidopsis plant cells. 
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48. A method according to claim 32 or claim 35, wherein the plant 
cells are monocotyledonous plant cells. 

49. A method according to claim 48, wherein the plant cells are corn, 
5 Lemna or rice plant cells. 

50. A method according to claim 32 or claim 35, wherein the plant 
cells are lower plant cells. 



10 



15 



51. A method according to claim 50, wherein the plant cells are 
green algae cells. 

52. A method according to claim 51, wherein the plant cells are 
Chlamydomonas rheinhardtii. 



53. A C H BP array in eukaryotic cells, comprising at least two 
eukaryotic cells that are each transformed with a different polynucleotide encoding at 
least one C H BP polypeptide that: 

(a) comprises an amino acid sequence that is at least 75% identical 
20 to a constant region tailpiece of a mu or alpha chain of a native immunoglobulin heavy 

chain; 

(b) comprises multiple combining sites, wherein all of the 
combining sites satisfy the same one of the following requirements: 

(i) at least 75% identity to a 25 consecutive amino acid 
25 portion of an immunoglobulin light chain variable region; or 

(ii) at least 75% identity to a 25 consecutive amino acid 
portion of an immunoglobulin heavy chain variable region; and 

(c) either (i) specifically binds to a ligand with a K D <1 0" 6 moles/liter; 
or (ii) forms one or more covalent bonds with one or more polypeptides in the 

30 transfected cell, to generate a C H BP that specifically binds to a ligand with a K^IO" 6 
moles/liter; and 

(d) differs in amino acid sequence from other C H BPs in the array; 
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wherein the cells assemble C H BPs comprising at least four combining 

sites. 

54. A binding protein array according to claim 53, wherein the 
5 polynucleotides encode polypeptide components of immunoglobulin molecules 

independently selected from the group consisting of heavy chains and fragments 
thereof, light chains and fragments thereof, J chains and secretory components. 

55. A binding protein array according to claim 53, wherein the cells 

10 are plant cells. 

I 

56. A binding protein array according to claim 53, wherein the cells 
are insect cells. 

15 57. A binding protein array according to claim 53, wherein the cells 

are mammalian cells. 

58. A binding protein array according to claim 55, wherein the plant 
cells are selected from the group consisting of com, rice, Lemna, tobacco and 

20 Chlamydomonas. 

59. A binding protein array according to claim 53, wherein at least 
1 0 different binding proteins are assembled by the cells in the array. 

25 60. A binding protein array according to claim 53, wherein at least 

100 different binding proteins are assembled by the cells in the array. 

61. A binding protein array according to claim 53, wherein at least 
100 dilferent binding proteins are assembled by the cells in the array. 

30 
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62. . A binding protein array according to claim 53, wherein at least 
10,000 different binding proteins are assembled by plant cells in the array. 

63. A binding protein array according to claim 53, wherein each cell 
within the array is transfected with at least two different polynucleotides, each encoding 
a different C H BP component, such that each cell assembles a functional C H BP 
comprising the C H BP components. 

64. A composition comprising an array of encapsulated C H BPs, 
wherein each C H BP: 

(a) comprises an amino acid sequence that is at least 75% identical 
to a constant region tailpiece of a mu or alpha chain of a native immunoglobulin heavy 
chain; 

(b) comprises at least four combining sites, wherein all of the 
•combining sites satisfy the same one of the following requirements: 

(i) at least 75% identity to a 25 consecutive amino acid 
portion of an immunoglobulin light chain variable region; or 

(ii) at least 75% identity to a 25 consecutive amino acid 
portion of an immunoglobulin heavy chain variable region; and 

(c) either (i) specifically binds to a ligand with a K^IO* moles/liter; 
or (ii) forms one or more covalent bonds with one or more polypeptides in a cell, to 
generate a C H BP that specifically binds to a ligand with a K D <1 0" 6 moles/liter; and 

(d) differs in amino acid sequence from other C H BPs in the array. 

65. A method for preparing a heavy chain binding protein array in 
eukaryotic cells, comprising the steps of: 

(a) exposing multiple copies of a polynucleotide encoding a native 
heavy chain to a mutagen, such that random or site-directed mutagenesis of the 
polynucleotide occurs, resulting in a library of heavy chain variants; 
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(b) transforming a population of eukaryotic cells with the library of 
heavy chain variants; and 

(c) growing the transformed cells on a medium that permits 
assembly of C H BPs, wherein each C H BP comprises at least four combining sites; 

5 and therefrom preparing a C H BP array in eukaryotic cells. 

66. A C H BP that: 

(a) comprises an amino acid sequence that is at least 75% identical 
to a constant region tailpiece of a mu or alpha chain of a native immunoglobulin heavy 

10 chain; 

(b) comprises at least four combining sites, wherein all of the 
combining sites satisfy the same one of the following requirements: 

(i) at least 75% identity to a 25 consecutive amino acid 
portion of an immunoglobulin light chain variable region; or 
15 (ii) at least 75% identity to a 25 consecutive amino acid 

portion of an immunoglobulin heavy chain variable region; and 

(c) either (i) specifically binds to a ligand with a K D <10" 6 moles/liter; 
or (ii) forms one or more covalent bonds with one or more polypeptides in a cell, to 
generate a C H BP that specifically binds to a ligand with a K D <10^ moles/liter. 

20 
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SEQUENCE LISTING 

<110> EPlcyte Pharmaceuticals, Inc. 
Hiatt, Andrew C. 
Hein, Mich B. 

<120> IMMUNOGLOBULIN BINDING PROTEIN ARRAYS IN 
EUKARYOTIC CELLS 

<130> 310098.406PC 

<140> PCT/US 
<141> 2001-05-02 

<160> 197 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211>4 
<212> PRT 

<21 3> Homo sapien - Mus muscuJus 
<220> 

<221> VARIANT 
<222> (1)...(4) 

<223> Xaa = Any Amino Acid 
<400> 1 

Phe Gly Xaa Gly 
1 

<210>2 
<211>4 
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<212> PRT 

<213> Homo sapien - Mus musculus 
<220> 

<221> VARIANT 
<222> (1)...(4) 

<223> Xaa = Any Amino Acid 
<400>2 

Cys Xaa Xaa Xaa 
1 

<210>3 
<211>5 
<212> PRT 

<213> Homo sapien - Mus musculus 
<400>3 

Leu Glu Trp lie Gly 
1 5 

<210>4 
<211>4 
<212> PRT 

<213> Homo sapien - Mus musculus 
<220> 

<221> VARIANT 
<222> (1)...(4) 

<223> Xaa = Any Amino Acid 

<400>4 
Trp Gly Xaa Gly 
1 
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<210>5 

<211>14 

<212> PRT 

<21 3> Homo sapien 

<400>5 

Arg Ala Ser Gin Ser Leu Val Ser lie Ser Ser Tyr Leu Ala 
1 5 10 

<210>6 
<211>7 
<212> PRT 
<213> Homo sapien 

<400> 6 

Ala Ala Ser Ser Leu GIu Ser 
1 5 



<210> 7 
<211> 10 
<212> PRT 
<213> Homo sapien 

<400> 7 

Gin Gin Tyr Asn Ser Leu Pro Glu Trp Thr 
1 5 10 



<210>8 
<211> 16 
<212> PRT 
<213> Homo sapien 



<400> 8 

Arg Ser Ser Gin Ser Leu Leu His Ser Asp Gly Asp Thr Tyr Leu Asn 
1 5 10 15 
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<210>9 

<211>7 

<212> PRT 

<213> Homo sapien 

<400>9 

Leu Val Ser Asn Arg Ala Ser 
1 5 

<210> 10 

<211> 8 

<212> PRT 

<213> Homo sapien 

<400> 10 

Met Gin Ala Leu Gin Pro Arg Thr 
1 5 

<210> 11 

<211> 12 

<212> PRT 

<213> Homo sapien 

<400> 11 

Arg Ala Ser Gin Ser Val Ser Ser Ser Tyr Leu Ala 
1 5 10 

<210> 12 

<21i> 7 

<212> PRT 

<213> Homo sapien 

<400> 12 

Gly Ala Ser Ser Arg Ala Thr 
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1 5 

<210> 13 
<211>10 
<212> PRT 
<213> Homosapien 

<400> 13 

Gin Gin Tyr Gly Ser Ser Pro Pro Leu Thr 
1 5 10 

<210> 14 

<211>17 

<212>PRT 

<213> Homo sapien 

<400> 14 

Lys Ser Ser Gin Ser Val Leu Tyr Ser Ser Asn Asn Lys Asn Tyr Leu 
1 5 10 15 

Ala 



<210> 15 

<211>7 

<212> PRT 

<213> Homo sapien 

<400> 15 

Trp Ala Ser Thr Arg Glu Ser 
1 5 

<210> 16 
<211>8 
<212> PRT 
<213> Homo sapien 
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<400> 16 

Gin Gin Tyr Tyr Ser Thr Pro Thr 
1 5 

<210> 17 

<211> 14 

<212> PRT 

<21 3> Homo sapien 

<400> 17 

Ser Gly Ser Ser Ser Asn lie He Gly Asn Asn Tyr Val Ser 
1 5 10 

<210>18 

<211>7 

<212> PRT 

<213> Homo sapien 

<400> 18 

Asp Asn Asn Lys Arg Pro Ser 
1 5 

<210>19 

<211> 14 

<212> PRT 

<213> Homo sapien 

<400> 19 

Ala Thr Trp Asp Asp Ser Leu Ser Ala Asn Ser Ala Pro Val 
1 5 10 

<210> 20 
<211>14 
<212>PRT 
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<213> Homo sapien 
<400> 20 

Thr Gly Thr Ser Ser Asp Val Gly Gly Tyr Asn Ala Vat Ser 
1 5 10 

<210>21 
<211> 7 
<212> PRT 
<213> Homo sapien 

<400> 21 

Asp Val Thr Asp Arg Pro Ser 
1 5 

<210>22 

<211>9 

<212> PRT 

<213> Homo sapien 

<400> 22 

Ser Ser Tyr Gly Gly Gly Ser Asn Val 
1 5 

<210> 23 
<211> 11 
<212> PRT 
<213> Homo sapien 

<400> 23 

Ser Gly Asp Asn Leu Gly Asp Lys Tyr Val His 
1 5 10 

<210>24 
<211> 7 
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<212>PRT 

<213> Homo sapien 

<400> 24 

Asp Asp Asn Lys Arg Pro Ser 
1 5 

<210>25 
<211> 13 
<212>PRT 
<213> Homo sapien 

<400> 25 

Gin Ala Trp Asp Ser Ser Ser Asp His Pro Gly Val Val 
1 5 10 

<210> 26 

<211> 17 

<212> PRT 

<213> Mus musculus 

<400> 26 

Lys Ser Ser Gin Ser Leu Leu Asn Ser Gfy Asn Gfn Lys Asn Tyr Leu 
16 10 15 

Ala 



<210> 27 

<211>7 

<212> PRT 

<213> Mus musculus 

<400> 27 

Trp Ala Ser Thr Arg Glu Ser 
1 5 
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<210> 28 

<211>9 

<212>PRT 

<213> Mus musculus 

<400> 28 

Gin Asn Asp Tyr Ser Tyr Pro Leu Thr 
1 5 

<210> 29 

<211> 16 

<212> PRT 

<213> Mus musculus 

<400> 29 

Arg Ser Ser Gin Ser Leu Val His Ser Asn GJy Asn Thr Tyr Leu Glu 
1 5 10 15 

<210> 30 

<211>7 

<212> PRT 

<213> Mus muscufus 

<400> 30 

Lys Val Ser Asn Arg Phe Ser 
1 5 

<210> 31 

<211> 10 

<212> PRT 

<213> Mus musculus 

<400> 31 

Phe Gin Gly Thr His Val Pro Pro Tyr Thr 
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1 5 10 

<210> 32 

<211>15 

<212> PRT 

<213> Mus musculus 

<400> 32 

Arg Ala Ser Glu Ser Val Asp Ser Tyr Gly Asn Ser Phe Met His 
1 5 10 15 

<210> 33 

<211>7 

<212> PRT 

<213> Mus musculus 

<400> 33 

Ala Ala Ser Asn Leu Glu Ser 
1 5 

<210>34 

<211> 10 

<212> PRT 

<213> Mus musculus . 

<400> 34 

Gin G(n Ser Asn Glu Asp Pro Pro Trp Thr 
1 5 10 

<210> 35 

<211>12 s 

<212> PRT 

<213> Mus musculus 

<400> 35 
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Ser Ala Ser Ser Ser Val Ser Ser Ser Tyr Leu His 
1 5 10 

<210>36 

<211>7 

<212>PRT 

<213> Mus musculus 

<400> 36 

Arg Thr Ser Asn Leu Ala Ser 
1 5 

<210>37 

<211> 10 

<212>PRT 

<213> Mus musculus 

<400> 37 

Gin Gin Trp Ser Ser Tyr Pro Gly Leu Thr 
1 5 10 

<210>38 

<211> 12 

<212> PRT 

<213> Mus musculus 

<400> 38 

Arg Ala Ser Gin Asp Asp He Ser Asn Tyr Leu Asn 
1 5 10 

<210> 39 

<211>7 

<212> PRT 

<213> Mus musculus 
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<400> 39 

Tyr Ala Ser Arg Leu His Ser 
1 5 

<210> 40 

<211> 10 

<212> PRT 

<213> Mus musculus 

<400> 40 

Gin Gin Gly Asn Thr Leu Pro Pro Arg Thr 
1 5 10 

<210> 41 

<211> 10 

<212> PRT 

<213> Mus musculus 

<400> 41 

Ser Ala Ser Ser Ser Val Ser Tyr Met His 
1 5 10 

<210> 42. 

<211>7 

<212> PRT 

<213> Mus musculus 

<400> 42 

Asp Thr Ser Lys Leu Ala Ser 
1 5 

<210> 43 

<211> 12 

<212> PRT 

<213> Mus musculus 
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<400> 43 

Gin Gin Trp Ser Ser Asn Pro Pro Met Pro Leu Thr 
1 5 10 

<210>44 

<211>5 

<212> PRT 

<213> Homo sapien 

<400> 44 
Ser Tyr Ala lie Ser 
1 5 

<210> 45 

<211> 18 

<212>PRT 

<213> Homo sapien 

<400> 45 

Trp lie Asn Pro Tyr Gly Asn Gly Asp Thr Asn Tyr Ala Gin Lys Phe 
15 10 15 

Gin Gly 

<210> 46 

<211>18 

<212> PRT 

<213> Homo sapien 

<400> 46 

Ala Pro Gly Tyr Gly Ser Gly Gly Gly Cys Tyr Arg Gly Asp Tyr Phe 
15 10 15 

Asp Tyr 
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<210> 47 

<211>7 

<212>PRT 

<213> Homo sapien 

<400> 47 

Ser Tyr Gly Trp Ser Trp Asn 
1 5 

<210> 48 

<211> 19 

<212> PRT 

<213> Homo sapien 

<400> 48 

Arg tie Tyr Tyr Arg Ala Tyr Ser Gly Ser Thr Thr Tyr Asn Pro Ser 
15 10 15 

Leu Lys Ser 

<210> 49 

<211> 19 

<212> PRT 

<213> Homo sapien 

<400> 49 l 
Glu Leu Pro Gly Gly Tyr Thr Gly Asp Asp Tyr Tyr Tyr Gly Ser Gly 
15 10 15 

Phe Asp Val 

<210> 50 
<211>5 
<212> PRT 
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<2 1 3> Homo sapien 
<400> 50 

SerTyrAla Met Ser 
1 5 

<210> 51 
<211>19 
<212> PRT 
<213> Homo sapien 

<400> 51 

Val lie Ser Gly Lys Thr Asp Gly Gly Ser Thr Tyr Tyr Ala Asp Ser 
1 5 10 15 

Val Lys Gly 

<210>52 

<211>19 

<212> PRT 

<213> Homo sapien 

<400> 52 

Gly Arg Pro Gly Asp Ser Leu Ser Gly Tyr Tyr Tyr Tyr Tyr His Tyr 
15 10 15 

Phe Asp Tyr 



<21 0> 53 

<211>7 

<212> PRT 

<213> Mus musculus 

<400> 53 

Ser Gly Tyr Trp Asn Asn Ser 
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1 5 

<210>54 

<211>17 

<212> PRT 

<21 3> Mus musculus 

<400>54 

Tyr lie Ser Gly Tyr Ser Gly Ser Thr Tyr Tyr Asn Pro Ser Leu Lys 
15 10 15 

Ser 



<210>55 

<211> 18 

<212> PRT 

<213> Mus musculus 

<400> 55 

Gly Gly Tyr Gly Tyr Gly Tyr Tyr Tyr Tyr Asp Tyr Tyr Tyr Tyr Phe 
1- 5 10 15 

Asp Tyr 



<210>56 

<211>6 

<212> PRT 

<21 3> Mus musculus 

<400> 56 

Asp Tyr Tyr Met Asn Asn 
1 5 

<210>57 
<211>17 
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<212> PRT 

<213> Mus musculus 

<400> 57 

Asp lie Asn Pro Gly Asn Gly Gly Thr Ser Tyr Asn Gin Lys Phe Lys 
1 5 10 15 

Gly 

<210> 58 

<211>16 

<212> PRT 

<213> Mus musculus 

<400> 58 

Gly Ser Tyr Tyr Ser Ser Ser Tyr Met Ala Tyr Tyr Ala Phe Asp Tyr 
15 10 15 

<210>59 

<211>5 

<212> PRT 

<213> Mus musculus 

<400> 59 

Asp Phe Tyr Met Glu 
1 5 

<210> 60 

<211> 19 

<212> PRT 

<213> Mus musculus 

<400> 60 

Ala Ser Arg Asn Lys Ala Asn Asp Tyr Thr Thr Glu Tyr Ser Ala Ser 
15 10 15 
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Val Lys Gly 

<210> 61 

<211>19 

<212> PRT 

<213> Mus musculus 

<400> 61 

Asp Tyr Tyr Tyr Gly Ser Ser Tyr Tyr Glu Gly Pro Val Tyr Trp Tyr 
15 10 15 

Phe Asp Val 

<210> 62 

<211> 18 

<212> PRT 

<213> Homo sapien 

<400> 62 

Pro Thr His Val Asn Val Ser Val Val Met Ala Glu Val Asp Gly Thr 
1 5 10 15 

Cys Tyr 



<210> 63 

<211> 15 

<212> PRT 

<213> Mus musculus 

<400> 63 

Pro Thr Val Asn Val He Met Ser Glu Gly Asp Gly lie Cys Tyr 
1 5 10 15 

<210> 64 
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<211> 18 

<212> PRT 

<213> Mus musculus 

<400> 64 

Pro Thr Asn Val Ser Val Ser Val He Met Ser GIu Gly Asp Gly lie 
1 5 10 15 

Cys Tyr 



<210> 65 
<211> 18 
<212> PRT 

<213> Rattus norvegfcus 
<400> 65 

Pro Thr Asn Val Asn Val Ser Val lie Met Ser GIu Gly Asp Gly lie 
1 5 10 15 

Cys Tyr 

<210> 66 
<211> 18 
<212> PRT 

<213> Oryctolagus cuniculus 
<400> 66 

Pro Thr His Val Asn Val Ser Val Val Val Ala Asp Val GIu Ala Val 
1 5 10 15 

Cys Tyr 



<210> 67 
<211> 18 
<212> PRT 
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<213> Homo sapien 
<400> 67 

Pro Thr Leu Tyr Asn Val Ser Leu Val Met Ser Asp Thr Ala Gly Thr 
15 10 15 

Cys Tyr 



<210>68 

<211> 18 

<212> PRT 

<213> Homo sapien 

<400> 68 

Pro Thr Leu Tyr Asn Val Ser Leu He Met Ser Asp Thr Gly Gly Thr 
1 5 10 15 

Cys Tyr 

<210> 69 
<211> 18 
<212> PRT 

<213> Cricetinae gen. sp. 
<400> 69 

Pro Thr Leu Tyr Asn Val Ser Leu lie Met Ser Asp Ala Gly Gly Thr 
15 10 15 

Cys Tyr 



<210> 70 
<211>18 
<212> PRT 

<213> Cricetinae gen. sp. 
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<400> 70 

Pro Thr Leu Tyr Asn Val Ser Leu Val Leu Ser Asp Thr Ala Giy Glx 
15 10 15 

Cys Tyr 



<210> 71 
<211> 18 
<212> PRT 

<213> Rattus norvegicus 
<400> 71 

Pro Thr Leu Tyr Asn Val Ser Leu lie Met Ser Asp Thr Ala Ser Thr 
1 5 10 15 

Cys Tyr 



<210> 72 
<211> 18 
<212> PRT 
<213> Gallus gallus 



<400> 72 

Pro Ser Phe Val Asn Val Ser Leu Val Leu Met Asp Thr Val Asn Ser 
1 5 10 15 

Cys Asn 



<210> 73 
<211> 19 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> signal sequence for plant expression vector 
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<400> 73 

Met Asp Trp Thr Gly Arg Phe Leu Phe Val Val Ala Ala Ala Thr Gly 
15 10 15 

Val Gin Ser 



<210> 74 

<211>23 

<212> PRT 

<213> Homo sapien 

<400> 74 

Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val Gly 
15 10 15 

Asp Arg Val Thr lie Thr Cys 
20 

<210>75 

<211> 15 

<212>PRT 

<2li3> Homo sapien 

<400> 75 

Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu lie Tyr 
15 10 15 

<210> 76 

<211> 32 

<212> PRT 

<213> Homo sapien 

<400> 76 

Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr 
15 10 15 
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Leu Thr Me Ser Ser Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys 
20 25 30 

<210> 77 
<211> 10 
<212> PRT 
<213> Homo sapien 

<400> 77 

Phe Gly Gin Gly Thr Lys Val Glu He Lys 
1 5 10 

<210> 78 

<211>23 

<212> PRT 

<213> Homo sapien 

<400> 78 

Asp lie Val Met Thr Gin Ser Pro Leu Ser Leu Pro Val Thr Pro Gly 
1 5 10 15 

Glu Pro Ala Ser He Ser Cys 
20 

<210> 79 
<211> 15 
<212> PRT 
<213> Homo sapien 

<400> 79 

Trp Tyr Leu Gin Lys Pro Gly Gin Ser Pro Gin Leu Leu lie Tyr 
1 5 10 15 

<210> 80 
<211>32 
<212>PRT 
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<213> Homo sapien 
<400> 80 

Gly Val Pro Asp Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr 
15 10 15 

Leu Lys lie Ser Arg Val Glu Ala Glu Asp Val Gly Val Tyr Tyr Cys 
20 25 30 

<210> 81 

<211>10 

<212> PRT 

<213> Homo sapien 

<400> 81 

Phe Gly Gin Gly Thr Lys Val Glu lie Lys 
1 5 10 

<210> 82 

<211>23 

<212> PRT 

<213> Homo sapien 

<400> 82 

Glu lie Val Leu Thr Gin Ser Pro Gly Thr Leu Ser Leu Ser Pro Gly 
1 5 10 15 

Glu Arg Ala Thr Leu Ser Cys 
20 

<210> 83 

<211> 15 

<212> PRT 

<213> Homo sapien 

<400> 83 

Trp Tyr Gin Gin Lys Pro Gly Gin Ala Pro Arg Leu Leu He Tyr 
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1 5 10 15 

<210> 84 

<211>32 

<212> PRT 

<213> Homo sapien 

<400> 84 

Gly lie Pro Asp Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr 
1 5 10 15 

Leu Thr He Ser Arg Leu Glu Pro Glu Asp Phe Ala Val Tyr Tyr Cys 
20 25 30 

<210>85 
<211>10 
<212> PRT 
<213> Homo sapien 

<400> 85 

Phe Gly Gin Gly Thr Lys Val Glu lie Lys 
1 5 10 

<210> 86 
<211>23 
<212> PRT 
<213> Homo sapien 

<400> 86 

Asp lie Val Met Thr Gin Ser Pro Asp Ser Leu Ala Val Ser Leu Gly 
1 5 10 15 

Glu Arg Ala Thr lie Asn Cys 
20 

<21 0> 87 
<211> 15 
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<212> PRT 

<213> Homo sapien 

<400> 87 

Trp Tyr G!n Gin Lys Pro Gly Gin Pro Pro Lys Leu Leu He Tyr 
1 5 10 15 

<210>88 

<211>32 

<212>PRT 

<213> Homo sapien 

<400> 88 

Gly Val Pro Asp Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr 
15 10 15 

Leu Thr lie Ser Ser Leu Gin Ala Glu Asp Val Ala Val Tyr Tyr Cys 
20 25 30 

<210>89 

<211> 10 

<212> PRT 

<2 1 3> Homo sapien 

<400> 89 

Phe Gly Gin Gly Thr Lys Val Glu He Lys 
1 5 10 

<210> 90 

<211>22 

<212> PRT 

<213> Homo sapien 

<400> 90 

Gin Ser Val Leu Thr Gin Pro Pro Ser Val Ser Gly Ala Pro Gly Gin 
15 10 15 
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Arg Val Thr lie Ser Cys 
20 

<210>91 

<211> 15 

<212> PRT 

<213> Homo sapien 

<400> 91 

Trp Tyr Gin Gin Leu Pro Gly Thr Ala Pro Lys Leu Leu lie Tyr 
1 5 10 15 

<210> 92 

<211>32 

<212>PRT 

<213> Homo sapien 

<400> 92 

Gly Val Pro Asp Arg Phe Ser Gly Ser Lys Ser Gly Thr Ser Ala Ser 
1 5 10 15 

Leu Ala lie Asn Gly Leu Gin Ser Glu Asp Glu Ala Asp Tyr Tyr Cys 
20 25 30 

<210> 93 

<211> 11 

<212> PRT 

<213> Homo sapien 

<400> 93 

Phe Gly Gly Gly Thr Lys Leu Thr Val Leu Gly 
1 5 10 

<210>94 
<211>22 
<212> PRT 
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<213> Homo sapien 
<400> 94 

Gin Ser Ala Leu Thr Gin Pro Ala Ser Val Ser Gly Ser Pro Gly Gin 
1 5 10 15 

Ser lie Thr lie Ser Cys 
20 

<210>*95 

<211> 15 

<212> PRT 

<213> Homo sapien 

<400> 95 

Trp Tyr Gin Gin His Pro Gly Lys Afa Pro Lys Leu Leu He Tyr 
1 5 10 15 

<210> 96 

<211>32 

<212> PRT 

<213> Homo sapien 

<400> 96 

Gly Val Pro Asp Arg Phe Ser Gly Ser Lys Ser Gly Asn Thr Ala Ser 
1 5 10 15 

Leu Thr lie Ser Gly Leu Gin Ala Glu Asp Glu Ala Asp Tyr Tyr Cys 
20 25 30 

<210> 97 

<211> 11 

<212> PRT 

<213> Homo sapien 

<400> 97 

Phe Gly Gly Gly Thr Lys Val Thr Val Leu Gly 
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1 5 10 

<210> 98 

<211>22 

<212>PRT 

<213> Homo sapien 

<400> 98 

Ser Tyr Glu Leu Thr Gin Pro Pro Ser Val Ser Val Ser Pro Gly G/n 
1 5 10 is 

Thr Ala Arg He Thr Cys 
20 

<210> 99 

<211> 15 

<212> PRT 

<2 1 3> Homo sapien 

<400> 99 

Trp Tyr Gin Gin Lys Pro Gly Gin Ala Pro Val Leu Val lie Tyr 
1 5 10 15 

<210> 100 

<211>32 

<212> PRT 

<213> Homo sapien 

<400> 100 

Gly lie Pro Glu Arg Phe Ser Gly Ser Asn Ser Gly Asn Thr Ala Thr 
1 5 10 15 

Leu Thr lie Ser Gly Val Gin Ala Gly Asp Glu Ala Asp Tyr Tyr Cys 
20 25 30 

<210> 101 
<211> 11 
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<212> PRT 

<213> Homo sapien 

<400> 101 

Phe Gly Gly Giy Thr Lys Leu Thr Va! Leu Gly 
1 5 10 

<210> 102 

<211>21 

<212> PRT 

<213> Homo sapien 

<400> 102 

Ser Glu Leu Thr Gin Pro Pro Ser Val Ser Val Ala Leu Gly Gin Thr 
1 5 10 15 

Va! Arg lie Thr Cys 
20 

<210> 103 
<211> 15 
<212> PRT 
<213> Homo sapien 

<400> 103 

Trp Tyr Gin Gin Lys Pro Gly Gin Ala Pro Leu Leu Val lie Tyr 
1 5 10 15 

<210> 104 

<211>32 

<212> PRT 

<213> Homo sapien 

<400> 104 

Gly He Pro Asp Arg Phe Ser Gly Ser Ser Ser Gly His Thr Ala Ser 
15 10 15 
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Leu Thr lie Thr Gly Ala Gin Ala Glu Asp Glu Ala Asp Tyr Tyr Cys 
20 25 30 

<210> 105 
<211> 11 
<212> PRT 
<213> Homo sapien 

<400> 105 

Phe Gly Gly Gly Thr Lys Leu Thr Val Leu Gly 
1 5 10 

<210> 106 

<211>21 

<212> PRT 

<213> Homo sapien 

<400> 106 

Ser Ala Leu Thr Gfn Pro Pro Ser Ala Ser Gly Ser Leu Gly Gin Ser 
1 5 10 15 

Val Thr lie Ser Cys 
20 

<210> 107 

<211>15 

<212> PRT 

<213> Homo sapien 

<400> 107 

Trp Tyr Gin Gin His Pro Gly Arg Ala Pro Lys Leu Val He Phe 
1 5 10 15 

<210> 108 
<211>32 
<212> PRT 
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<213> Homo sapien 
<400> 108 

Gly Val Pro Asp Arg Phe Ser Gly Ser Lys Ser Asp Gin Thr Ala Ser 
15 10 15 

Leu Thr Vai Ser Gly Leu Arg Ala Glu Asp Glu Ala Asp Tyr Tyr Cys 
20 25 30 

<210> 109 
<211> 11 
<212>PRT 
<213> Homo sapien 

<400> 109 

Phe Gly Gly Gly Thr Lys Leu Thr Val Leu Arg 
1 5 10 

<210> 110 

<211>22 

<212> PRT 

<21 3> Homo sapien 

<400> 110 

Asn Phe Met Leu Thr Gin Pro His Ser Val Ser Glu Ser Pro Gly Lys 
15 10 15 

Thr Val Thr He Ser Cys 
20 

<210> 111 
<211> 15 
<212> PRT 
<213> Homo sapien 

<400> 111 

Trp Tyr Gin Gin Arg Pro Gly Ser Ala Pro Thr Thr Val He Tyr 
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1 5 10 15 

<210> 112 

<211>31 

<212> PRT 

<213> Homo sapien 

<400> 112 

Gly Val Pro Asp Arg Phe Ser Gly Ser Ser Ser Asn Ser Ala Ser Leu 
1 5 10 15 

Thr He Ser Gly Leu Lys Thr Glu Asp Glu Ala Asp Tyr Tyr Cys 
20 25 30 

<210> 113 

<211>11 

<212> PRT 

<213> Homo sapien 

<400>113 

Phe Gly Gly Gly Thr Lys Leu Thr Val Leu Gly 
1 5 10 

<210> 114 

<211>23 

<212> PRT 

<213> Mus musculus 

<400> 114 

Asp lie Val Met Thr Gin Ser Pro Ser Ser Leu Ala Val Ser Ala Gly 
1 5 10 15 

Glu Lys Val Thr Met Ser Cys 
20 

<210>115 
<211>15 
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<212> PRT 

<213> Mus musculus 

<400>115 

Trp Tyr Gin Gin Lys Pro Gly Gin Ser Pro Lys Leu Leu He Tyr 
1 5 10 15 

<210>116 

<211> 32 

<212> PRT 

<213> Mus musculus 

<400> 116 

Gly Val Pro Asp Arg Phe Thr Gly Ser Gly Ser Gly Thr Asp Phe Thr 
15 10 15 

Leu Thr lie Ser Ser Val Gin Ala Glu Asp Leu Ala Val Tyr Tyr Cys 
20 25 30 

<210> 117 

<211> 10 

<212> PRT 

<213> Mus musculus 

<400> 117 

Phe Gly Ala Gly Thr Lys Leu Glu Leu Lys 
1 5 10 

<210> 118 

<211>23 

<212> PRT 

<213> Mus musculus 

<400> 118 

Asp Val Val Met Thr Gin Thr Pro Leu Ser Leu Pro Val Ser Leu Gly 
15 10 15 
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Asp Gin Ala Ser lie Ser Cys 
20 

<210> 119 

<211> 15 

<212> PRT 

<21 3> Mus musculus 

<400> 119 

Tip Tyr Leu Gin Lys Pro Gly Gin Ser Pro Lys Leu Leu He Tyr 
1 5 10 15 

<210> 120 

<211> 32 

<212> PRT 

<213> Mus musculus 

<400> 120 

Gly Val Pro Asp Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr 
1 5 10 15 

Leu Lys lie Ser Arg Val Glu Ala Glu Asp Leu Gly Val Tyr Tyr Cys 
20 25 30 

<210> 121 

<211> 10 

<212> PRT 

<213> Mus musculus 

<400> 121 

Phe Gly Gly Gly Thr Lys Leu Glu lie Lys 
1 5 10 

<210> 122 
<211>23 
<212> PRT 
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<213> Mus musculus 
<400> 122 

Asp lie Val Leu Thr Gin Ser Pro Ala Ser Leu Ala Val Ser Leu Gly 
1 5 10 15 

Gin Arg Ala Thr lie Ser Cys 
20 

<210> 123 

<211>15 

<212> PRT 

<21 3> Mus musculus 

<400> 123 

Trp Tyr Gin Gin Lys Pro Gly Gin Pro Pro Lys Leu Leu lie Tyr 
15 10 15 

<210> 124 

<211>32 

<212> PRT 

<213> Mus musculus 

<400> 124 

Gly Val Pro Ala Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr 
1 5 10 15 

Leu Asn lie His Pro Val Glu Glu Asp Asp Ala Ala Thr Tyr Tyr Cys 
20 25 . 30 

<210> 125 

<211> 10 

<212> PRT 

<213> Mus musculus 

<400> 125 

Phe Gly Gly Gly Thr Lys Leu Glu lie Lys 
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10 



<210> 126 

<211>23 

<212> PRT 

<213> Mus musculus 

<400> 126 

Glu He Val Leu Thr Gin Ser Pro Ala lie Met Ser Ala Ser Pro Gly 
1 5 10 15 

Glu Lys Val Thr Met Thr Cys 
20 

<210> 127 
<211> 15 
<212> PRT 
<213> Mus musculus 

<400> 127 

Trp Tyr Gin Gin Lys Pro Gly Asn Ser Pro Lys Leu Trp lie Tyr 
1 5 10 is 

<210> 128 
<211> 32 
<212> PRT 
<213> Mus musculus 

<400> 128 

Gly Val Pro Ala Arg Phe Ser Gly Ser Gly Ser Gly Thr Ser Tyr Ser 
1 5 10 15 

Leu Thr He Ser Ser Met Glu Ala Glu Asp Ala Ala Thr Tyr Tyr Cys 
20 25 30 

<210> 129 
<211> 10 
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<212> PRT 

<213> Mus musculus 

<400> 129 

Phe Gly Ala Gly Thr Lys Leu Glu lie Lys 
1 5 10 

<210> 130 

<211>23 

<212> PRT 

<21 3> Mus musculus 

<400> 130 

Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Leu Gly 
15 10 15 

Asp Arg Val Thr lie Thr Cys 
20 

<210> 131 

<211> 15 

<212> PRT 

<213> Mus musculus 

<400> 131 

Trp Tyr Gin Gin Lys Pro Gly Gly Ser Pro Lys Leu Leu He Tyr 
1 5 10 15 

<210> 132 

<211>32 

<212> PRT 

<213> Mus musculus 

<400> 132 

Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Tyr Ser 
1 5 10 15 
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Leu Thr lie Ser Gin Leu Glu Gin Glu Asp lie Ala Thr Tyr Phe Cys 
20 25 30 

<210>133 

<211>10 

<212> PRT 

<213> Mus musculus 

<400> 133 

Phe Gly Gly Gly Thr Lys Leu Glu lie Lys 
1 5 10 

<210> 134 

<211>23 

<212> PRT 

<213> Mus musculus 

<400> 134 

Gin lie Val Leu Thr Gin Ser Pro Ala lie Met Ser Ala Ser Pro Gly 
1 5 10 15 

Glu Lys Val Thr Met Thr Cys 
20 

<210> 135 
<211> 15 
<212> PRT 
<213> Mus musculus 

<400> 135 

Trp Tyr Gin Gin Lys Ser Gly Thr Ser Pro Lys Arg Trp lie Tyr 
1 5 10 15 

<210> 136 
<211>32 
<212> PRT 
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<213> Mus musculus 
<400> 136 

Gly Val Pro Ala Arg Phe Ser Gly Ser Gly Ser Gly Thr Ser Tyr Ser 
15 10 15 

Leu Thr lie Ser Ser Met GIu Ala Glu Asp Ala Ala Thr Tyr Tyr Cys 
20 25 30 

<210> 137 

<211> 10 

<212> PRT 

<213> Mus musculus 

<400> 137 

Phe Gly Ala Gly Thr Lys Leu Glu Leu Lys 
1 5 10 

<210> 138 

<211>22 

<212> PRT 

<21 3> Mus musculus 

<400> 138 

Gin Ala Val Val Thr Gin Glu Ser Ala Leu Thr Thr Ser Pro Gly Glu 
1 5 10 15 

Thr Val Thr Leu Thr Cys 
20 

<210> 139 

<211> 15 

<212> PRT 

<21 3> Mus musculus 

<400> 139 

Trp Val Gin Glu Lys Pro Asp His Leu Phe Thr Gly Leu lie Gly 
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<210> 140 

<211> 32 

<212> PRT 

<213> Mus musculus 



<400> 140 

Gly Val Pro Ala Arg Phe Ser Gly Ser Leu lie Gly Asp Lys Ala Afa 
1 5 10 15 

Leu Thr lie Thr Gly Ala Gin Thr Glu Asp Glu Ala lie Tyr Phe Cys 
20 25 30 



<210> 141 

<211> 11 

<212> PRT 

<213> Mus musculus 

<400> 141 

Phe Gly Gly Gly Thr Lys Leu Thr Val Leu Gly 
1 5 10 



<210>142 

<211>30 

<212> PRT 

<213> Homo sapien 



<400> 142 

Gin Val Gin Leu Val Gin Ser Gly Ala Glu Val Lys Lys Pro Gly Ala 
1 5 10 15 

Ser Val Lys Val Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr 
20 25 30 



<210> 143 
<211> 14 
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<212>PRT 

<213> Homo sapien 

<400> 143 

Trp Val Arg Gly Ala Pro Gly Gin Gly Leu Glu Trp Met Gly 
1 5 10 

<210> 144 

<211> 32 

<212> PRT 

<21 3> Homo sapien 

<400> 144 

Arg Val Thr lie Thr Ala Asp Thr Ser Thr Ser Thr Ala Tyr Met Glu 
1 5 10 15 

Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Val Tyr Tyr Cys Ala Arg 
20 25 30 

<210> 145 

<211> 11 

<212> PRT 

<21 3> Homo sapien 

<400> 145 

Tip Gly Gin Gly Thr Leu Val Thr Val Ser Ser 
1 5 10 

<210> 146 

<211>30 

<212>PRT 

<213> Homo sapien 

<400> 146 

Gin Val Gin Leu Gin Glu Ser Gly Pro Gly Leu Val Lys Pro Ser Gin 
1 5 10 15 
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Thr Leu Ser Leu Thr Cys Thr Val Ser Gly Gly Ser Val Ser 
20 25 30 

<210> 147 
<211> 14 
<212>PRT 
<213> Homo sapien 

<400> 147 

Trp lie Arg Gin Pro Pro Gly Lys Gly Leu Glu Trp lie Gly 
1 5 10 

<210> 148 

<211>32 

<212> PRT 

<213> Homo sapien 

<400> 148 

Arg Val Thr He Ser Val Asp Thr Ser Lys Asn Gin Phe Ser Leu Lys 
1 5 10 15 

Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Val Tyr Tyr Cys Ala Arg 
20 25 30 

<210>149 
<211> 11 
<212> PRT 
<213> Homo sapien 

<400> 149 

Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser 
1 5 10 

<210> 150 
<211>30 
<212> PRT 
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<213> Homo sapien 
<400> 150 

Glu Val Gin Leu Val Glu Ser Gly GJy Gly Leu Va) GJn Pro Gly Gly 
15 10 15 

Ser Leu Arg Leu Ser Cys Ala Ala Cys Gly Phe Thr Phe Ser 
20 25 30 

<210> 151 

<211>14 

<212> PRT 

<213> Homo sapien 

<400> 151 

Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Ser 
1 5 10 

<210> 152 

<211>32 

<212> PRT 

<21 3> Homo sapien 

<400> 152 

Arg Phe Thr He Ser Arg Asp Asn Ser Lys Asn Thr Leu Tyr Leu Gin 
1 5 10 15 

Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala Arg 
20 25 30 

<210> 153 
<211> 11 
<212> PRT 
<213> Homo sapien 

<400> 153 

Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser 
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<210> 154 

<211>30 

<212> PRT 

<213> Mus musculus 



<400> 154 

Glu Val Gin Leu Gin Glu Ser Gly Pro Ser Leu Val Lys Pro Ser Gin 
1 5 10 15 

Thr Leu Ser Leu Thr Cys Ser Val Thr Gly Asp Ser lie Thr 
20 25 30 



<210> 155 
<211> 14 
<212> PRT 
<213> Mus musculus 



<400> 155 

Trp lie Arg Gin Phe Pro Gly Asn Lys Leu Glu Trp Met Gly 
1 5 10 

-\ 

<210> 156 

<211>32 

<212>PRT 

<213> Mus musculus 

<400> 156 

Arg lie Ser He Thr Arg Asp Thr Ser Lys Asn Gin Tyr Phe Leu Gin 
1 5 10 15 

Leu Asn Ser Val Thr Thr Glu Asp Thr Ala Thr Tyr Tyr Cys Ala Arg 
20 25 30 

<210> 157 
<211>11 
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<212> PRT 

<21 3> Mus musculus 

<400> 157 

Trp Gly Gfn Gly Thr Thr Val Thr Va! Ser Ser 
1 5 10 

<210> 158 

<211>30 

<212> PRT 

<213> Mus musculus 

<400> 158 

Gin Val Gin Leu Lys Glu Ser Gly Pro Gly Leu Val Ala Pro Ser Gin 
15 10 15 

Ser Leu Ser He Thr Cys Thr Val Ser Gly Phe Ser Leu Thr 
20 25 30 

<210>159 

<211> 14 

<212> PRT 

<21 3> Mus musculus 

<400> 159 

Trp Val Arg Gin Pro Pro Gly Lys Gly Leu Glu Trp Leu Gly 
1 5 10 

<210> 160 

<211>32 

<212> PRT 

<21 3> Mus musculus 

<400> 160 

Arg Leu Ser Me Ser Lys Asp Gin Ser Lys Ser Gin Val Phe Leu Lys 
1 5 10 15 
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Met Asn Ser Leu Gin Thr Asp Asp Thr Ala Met Tyr Tyr Cys Ala Arg 
20 25 30 

<210> 161 

<211> 11 

<212> PRT 

<213> Mus musculus 

<400> 161 

Trp Gly Gin Gly Thr Ser Val Thr Val Ser Ser 
1 5 10 

<210> 162 

<211>30 

<212> PRT 

<213> Mus musculus 

<400> 162 

Glu Val Gin Leu Gin Gin Ser Gly Pro Glu Leu Val Lys Pro Gly Ala 
1 5 10 15 

Ser Val Lys He Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr 
20 25 30 

<210> 163 

<211> 14 

<212> PRT 

<213> Mus musculus 

<400> 163 

Trp Val Lys Gin Ser Pro Gly Lys Ser Leu Glu Trp He Gly 
1 5 10 

<210> 164 
<211>32 
<212> PRT 
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<213> Mus musculus 
<400> 164 

Lys Ala Thr Leu Thr Val Asp Lys Ser Ser Ser Thr Ala Tyr Met Gin 
1 5 10 . 15 

Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr Tyr Cys Ala Arg 
20 25 30 

<210> 165 

<211>11 

<212> PRT 

<213> Mus musculus 

<400> 165 

Trp Gly Gin Gly Thr Thr Val Thr Val Ser Ser 
1 5 10 

<210> 166 

<211>30 

<212> PRT 

<21 3> Mus musculus 

<400> 166 

Gin Val Gin Leu Gin Gin Pro Gly Ala Glu Leu Val Lys Pro Gly Ala 
15 10 15 

Ser Val Lys Leu Ser Cys Lys Ala Ser Gly Tyr Thr Pro Thr 
20 25 30 

<210> 167 
<211> 14 
<212> PRT 

<213> Mus musculus • 
<400> 167 

Trp Val Lys Gin Arg Pro Gly Gin Gly Leu Glu Trp lie Gly 
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<210>168 

<211>32 

<212> PRT 

<213> Mus musculus 

<400> 168 

Lys Afa Thr Leu Thr Val Asp Lys Ser Ser Ser Thr Ala Tyr Met Gin 
1 5 10 15 

Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr Tyr Cys Ala Arg 
20 25 30 

<210>169 

<211>11 

<212> PRT 

<213> Mus musculus 

<400> 169 

Trp Val Gin Gly Thr Thr Val Thr Val Ser Ser 
1 5 10 

<210> 170 

<211>30 

<212> PRT 

<213> Mus musculus 

<400> 170 

Glu Val Gin Leu Gin Gin Ser Gly Ala Gfu Leu Val Lys Pro Gly Ala 
1 5 10 ' 15 

Ser Val Lys Leu Ser Cys Thr Ala Ser Gly Phe Asn lie Lys 
20 25 30 

<210> 171 
<211> 14 
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<212>PRT 

<213> Mus musculus 

<400> 171 

Trp Val Lys Gin Arg Pro Glu Gin Gly Leu Glu Tip He Gly 
1 5 10 

<210> 172 

<211>32 

<212> PRT 

<213> Mus musculus 

<400>172 

Lys Ala Thr He Thr Ala Asp Thr Ser Ser Asn Thr Ala Tyr Leu Gin 
15 10 15 

Leu Ser Ser Leu Thr Ser Glu Asp Thr Ala Val Tyr Tyr Cys Ala Arg 
20 25 30 

<210>173 
<211> 11 
<212> PRT 
<213> Mus musculus 

<400> 173 

Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser 
1 5 10 

<210> 174 

<211>30 

<212> PRT 

<213> Mus musculus 

<400> 174 

Glu Val Lys Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly Gly 
1 5 10 15 
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Ser Leu Arg Leu Ser Cys Ala Thr Ser Gly Phe Thr Phe Ser 
20 25 30 

<210>175 
<211> 14 
<212>PRT 
<213> Mus musculus 

<400> 175 

Trp Val Arg Gin Pro Pro Gly Lys Ala Leu Glu Trp He AJa 
15 10 

<210> 176 

<211> 32 

<212> PRT 

<213> Mus musculus 

<400> 176 

Arg Phe Thr Val Ser Arg Asp Thr Ser Gin Ser lie lie Tyr Leu Gin 
1 5 10 15 

Met Asn Ala Leu Arg Ala Glu Asp Thr Ala lie Tyr Tyr Cys Ala Arg 
20 25 30 

<210>177 

<211>11 

<212> PRT 

<213> Mus musculus 

<400> 177 

Trp Gly Ala Gly Thr Thr Val Thr Val Ser Ser 
1 5 10 

<210> 178 
<211>30 
<212> PRT 
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<213> Mus musculus 
<400> 178 

Glu Val Lys Leu Leu Glu Ser Giy Gly Gly Leu Val Gin Pro Gly Gly 
15 10 15 

Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Phe Asp Phe Ser 
20 25 30 

<210> 179 

<211>14 

<212> PRT 

<213> Mus musculus 

<400> 179 

Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp lie Gly 
1 5 10 

<210> 180 

<211>32 

<212> PRT 

<21 3> Mus musculus 

<400> 180 

Lys Phe Me lie Ser Arg Asp Gin Ala Lys Gin Thr Leu Tyr Leu Gin 
15 10 15 

Met Ser Lys Val Arg Ser Glu Asp Thr Ala Leu Tyr Tyr Cys Ala Arg 
20 25 30 

<210> 181 

<211> 11 

<212> PRT 

<213> Mus musculus 

<400> 181 

Trp Gly Gin Gly Thr Thr Val Thr Val Ser Ser 
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1 5 10 

<210> 182 

<211>30 

<212> PRT 

<21 3> Mus musculus 

<400> 182 

Giu Val Lys Leu Glu Glu Ser Gly Gly Gly Leu Val Gin Pro Gly Gly 
1 5 10 15 

Ser Met Lys Leu Ser Cys Val Ala Ser Gly Phe Thr Phe Ser 
20 25 30 

<210> 183 

<211>14 

<212> PRT 

<213> Mus musculus 

<400> 183 

Trp Val Arg Gin Ser Pro Glu Lys Gly Leu Glu Tip Val Ala 
1 5 10 

<210> 184 

<211>32 

<212> PRT 

<213> Mus musculus 

<400> 184 

Arg Phe Thr He Ser Arg Asp Asp Ser Lys Ser Ser Val Tyr Leu Gin 
15 10 15 

Met Asn Asn Leu Arg Ala Glu Asp Thr Gly lie Tyr Tyr Cys Thr Thr 
20 25 30 

<210> 185 
<211> 11 
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<212> PRT 

<213> Mus musculus 

<400> 185 

Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser 
1 5 10 

<210> 186 

<211>30 

<212> PRT 

<21 3> Mus musculus 

<400> 186 

Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Lys Pro Gly Gly 
1 5 10 15 

Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Ser 
20 25 30 

<210>187 

<211> 14 

<212> PRT 

<213> Mus musculus 

<400> 187 

Trp Val Arg Gin Thr Pro Glu Lys Arg Leu Glu Trp Val Ala 
1 5 10 

<210> 188 

<211> 32 

<212> PRT 

<213> Mus musculus 

<400> 188 

Arg Phe Thr lie Ser Arg Asp Gin Ala Lys Gin Thr Leu Tyr Leu Gin 
15 10 15 
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Met Ser Ser Leu Arg Ser Glu Asp Thr Ala Met Tyr Tyr Cys Ala Arg 
20 25 30 

<210>189 

<211> 11 

<212> PRT 

<213> Mus musculus 

<400> 189 

Trp Gly Gin Gly Thr Ser Val Thr Val Ser Ser 
1 5 10 

<210> 190 

<211>30 

<212> PRT 

<213> Mus musculus 

<400> 190 

Glu Val Gin Leu Gin Gin Ser Gly Ala Glu Leu Val Arg Ala Gly Ser 
1 5 10 15 

Ser Val Lys Met Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr 
20 25 30 

<210> 191 

<211>14 

<212> PRT 

<213> Mus musculus 

<400> 191 

Trp Val Lys Gin Arg Pro Gly Gin Gly Leu Glu Trp Me Gly 
1 5 10 

<210> 192 
<211>32 
<212> PRT 
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<213> Mus musculus 
<400> 192 

Lys Thr Thr Leu Thr Val Asp Lys Ser Ser Ser Thr Ala Tyr Met G!n 
1 5 10 15 

Leu Arg Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr Phe Cys Ala Arg 
20 25 30 

<210> 193 

<211> 11 

<212> PRT 

<213> Mus musculus 

<400> 193 

Trp Gly Gin Gly Thr Thr Leu Thr Val Ser Ser 
1 5 10 

<210> 194 

<211>31 

<212> PRT 

<213> Mus musculus 

<400> 194 

Ser Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu VaJ Lys Pro Gly 
15 10 15 

Gly Ser Val Lys Leu Ser Cys Lys Ala Ser Gly Phe Thr Phe Ser 
20 25 30 

<210> 195 
<211> 14 
<212> PRT 
<213> Mus musculus 

<400> 195 

Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Gly 
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<210> 196 

<211>32 

<212>PRT 

<213> Mus museums 

<400> 196 

Arg Phe Thr He Ser Arg Asp Asn Ser Lys Ser Thr Leu Tyr Leu Gin 
1 5 10 15 

Met Ser Ser Leu Arg Ser Glu Asp Thr Ala Thr Tyr Tyr Cys Ala Arg 
20 25 30 

<210> 197 

<211> 11 

<212> PRT 

<213> Mus musculus 



<400> 197 

Trp Gly Gin Gly Thr Thr Val Thr Val Ser Ser 
1 5 10 
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